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Methods of Eliciting Broadly Neutralizing Antibodies 
Targeting HIV-1 gp41 



Background of the Invention 



Statement as to Rights to Inventions Made Under 
Federally-Sponsored Research and Development 

Part of the work performed during development of this invention utilized U.S. 
Government funds. The U.S. Government has certain rights in this invention pursuant to 
INNOVATION Grant No. R21 AI 42714. 

Field of the Invention 

The present invention is related to HIV therapy and prophylaxis. In particular, the 
invention relates to methods for eliciting broadly neutralizing antibodies that target entry- 
relevant structures of HIV-1 gp41. Such methods, and pharmaceutical compositions 
therefor, can be employed to inhibit HIV entry into uninfected cells. 

Related Art 

The development of effective vaccines to prevent infection with HIV remains a 
high priority goal. To date, envelope glycoproteins (gpl60 and gpl20/gp41) have been 
the main focus of vaccine research efforts. One result of this work is the observation that 
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the humoral response generated against native forms of the envelope (primarily oligomeric 
forms of the gpl20/gp41 complex) is more broadly neutralizing than antibody raised 
against denatured and/or monomeric envelope (VanCott, T. C, etaL, J. ViroL 77:4319- 
4330(1 997)) . S tructural considerations are important components for both understanding 
5 the immunogenicity of the envelope protein and the design of envelope based immunogens 
which induce a broad neutralizing response against HIV. 

A good deal of structural information is available with respect to the 
w transmembrane protein (TM or gp4 1). Predictive work indicated that several regions of 
the ectodomain of gp41 display a high propensity to exhibit certain specific types of 

10 secondary structure (Gallaher, W. R., et aL, AIDS Res. Hum. Retroviruses 5:431-440 
(1989); Delwart, E. L., et aL, AIDS Res. Hum. Retroviruses 6:703-704 (1990)). 
Experimental work employing both synthetic peptides and protein recombinants has 
established that these predictions were generally correct and recently a three dimensional 
structure for a portion of the gp41 ectodomain was reported (Wild, C, et aL, Proc. Natl. 

15 Acad. Sci. USA 59:10537-10541 (1992); Wild, C, et al t Proc. Natl Acad. Sci. USA 
97:12676^12680 (1994); Wild, C, et aL, AIDS Res. Hum. Retroviruses 77:323-325 
(1995); Chan, D. C, et al, Cell 59:263-273 (1997)). Results from both solution studies 
and crystallographic analysis indicate that in one form this structured region of the 
transmembrane protein is a trimer of two interacting regions of gp41. This trimeric 

20 structure is a six helix bundle consisting of an interior parallel coiled-coil trimer (region 
one) which associates with three identical a-helices (region two) which pack in an oblique, 
antiparallel manner into the hydrophobic grooves on the surface of the coiled-coil trimer 
(FIG. 3). This hydrophobic self-assembly domain is believed to constitute the core 
structure of gp41. 

25 A series of studies carried out using both synthetic peptides and recombinant 

proteins modeling the distal regions of the TM involved in generating this structure 
suggest that it (or the gp41 regions from which it is derived) plays a critical role in the 
process of HrV-1 entry (Wild, C, et aL, Proc. Natl. Acad. Sci. USA 59:10537-10541 
(1992); Wild, C, et aL, AIDS Res. Hum. Retroviruses 9:1051-1053 (1993);Wild, C, et 

30 aL, Proc. Natl. Acad. Sci. USA 97:12676-12680 (1994); Wild, C, etal. t AIDS Res. Hum. 
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Retroviruses 77:323-325 (1995); Wild, C, etal, Proc. Natl Acad. Sci. USA 97:9770- 
9774 (1994); Chen, C. -H., et aL t J. Virol. 69:3771-3777 (1995)). 

The functional role of the transmembrane protein of HIV- 1 in virus replication was 
shown when the region of the ectodomain of the TM corresponding to amino acid 

5 residues 558-595, which was predictive of a-helical secondary structure (Gallaher, W. R., 
et al, AIDS Res. Hum. Retroviruses 5:43 1-440 (1989); Delwart, E. L., et aL, AIDS Res. 
Hum. Retroviruses 6:703-704 (1990)), formed a coiled-coil structure when modeled as 
a synthetic peptide (Wild, C, etai, Proc. Natl Acad. Sci. USA 89: 10537-10541 (1992)). 
The peptide modeling this region, DP- 107, was shown to be a potent, virus specific 

10 inhibitor of HIV replication and the inhibitory activity was related to the structural 
components exhibited by the peptide. In both neutralization and cell-cell fusion assays, 
the DP- 107 peptide completely blocked virus infection at concentrations of 1.0 |ig/ml. 
Unlike other inhibitors of HIV replication (i.e. soluble CD4) and most neutralizing sera, 
the activity of the DP- 107 peptide was not isolate restricted. Using a series of DP- 107 

15 analogs containing structure disrupting point mutations and a set of HIV-1 envelope 
constructs containing identical mutations, it has been shown that the structural 
components of the coiled-coil region of the TM were critical to both virus entry and 
fusion phenotype and that mutations which disrupted this gp41 structure gave rise to an 
envelope complex which was unable to mediate virus entry (Wild, C, et al t Proc. Natl. 

20 Acad. Sci USA 97:12676-12680 (1994)). 

Studies of the coiled-coil domain of gp41 resulted in the identification of a second 
region of the ectodomain of the TM, which when modeled as a synthetic peptide, was also 
a potent, virus specific inhibitor of HIV replication (Wild, C, et al, AIDS Res. Hum. 
Retroviruses 9:1051-1053 (1993)). However, unlike the DP-107 region, the peptide 

25 corresponding to amino acid residues 643-678 of the TM (DP- 178), did not exhibit stable 
solution structure. Experiments with the DP-107 and DP-178 peptides established that 
both of these materials blocked HIV replication at an early step, most likely during virus 
entry (Wild, C, et al, Proc. Natl. Acad. Sci. USA 97:9770-9774 (1994)). This 
observation led to speculation that these peptides might inhibit virus replication by 

30 interacting with and disrupting determinants within the TM that were critical for virus 
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entry. Efforts to better define the higher order structural components that were present 
in gp41 and functioned during virus entry led to the observation that the distal regions of 
the TM modeled by the two inhibitory peptides (DP- 107 and DP-178) did interact with 
one another to form an oligomeric structure (Wild, C, et al, AIDS Res. Hum. 
5 Retroviruses 77:323-325 (1995); Chen, C. -H., et al, J. Virol. 693111-3111 (1995)). 
Recently, this oligomeric structure was characterized as a trimeric, six helix bundle 
consisting of an interior parallel coiled-coil trimer (DP- 107 region) which associates with 
three identical a-helices (DP- 1 78 region) which pack into the hydrophobic grooves on the 
surface of the coiled-coil trimer (Figure 3) (Chan, D. C, etal, Cell 89:263-273 (1997)). 

1 0 Research has focused on determining the functional role of these gp4 1 structural 

determinants in virus entry. DP- 107 and DP-178 peptides interact in a specific manner 
with the ectodomain of gp41 and this interaction is critical to their inhibitory activities. 

U.S. Patent No. 5,464,933, Bolognesi et a/., describes peptides which exhibit 
potent anti-retroviral activity. Specifically disclosed are the peptide DP-178 (SEQ ID 

15 NO:3) derived from the HIV-l^ gp41 protein, as well as fragments, analogs and 
homologs of DP- 178. The peptides are used as direct inhibitors of human and non-human 
retroviral transmission to uninfected cells. The patent teaches that the peptides may also 
be prophylactically employed in individuals after such individuals have had an acute 
exposure to HIV. 

20 U.S. Patent No. 5,656,410, Wild etal, describes protein fragments derived from 

the HIV transmembrane glycoprotein (gp41), including the peptide DP- 107 (SEQ. ID 
NO: 1) which have antiviral activity. Also disclosed are methods for inhibiting enveloped 
viral infection, and methods that modulate biochemical processes involving coiled coil 
peptide interactions. 

25 While recent work has increased knowledge of the structural components of the 

HIV-1 transmembrane protein, the immunogenic nature of gp41 remains poorly 
understood. It is known that one of two immunodominant regions present in the HTV-1 
envelope complex is located in gp41 (Xu, J. -Y., et al, J. Virol 65:4832-4838 (1991)). 
This determinant (TM residues 597-613) is associated with a strong, albeit 

30 non-neutralizing humoral response in a large number of HIV+ individuals. Also, the 
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broadly neutralizing antibody, 2F5, maps to the ectodomain of gp41 (TM residues 
662-667) (Muster, T., et al, J. Virol 67:6642-6647 (1993); Muster, T., et al, J. Virol 
65:4031-4034 (1994)). It is interesting to note that this antibody maps to a determinant 
of the TM that overlaps one of the two regions of gp41 which interact to form the 
5 recently characterized hydrophobic core of the protein (Figure 1). This observation has 
lead to speculation that 2F5 might actually neutralize virus by interacting with and 
disrupting the function of an entry- relevant gp41 structure. An extensive study which 
mapped the antigenic structure of gp41 supports this idea. This work characterized 
several conformation dependent gp4 1 M Abs which mapped to the same region of the TM 

10 as2F5(Earl,P.L.,<?ra/., /. Virol 77:2647-2684(1997)). Although the binding sites for 
these non-neutralizing monoclonal antibodies (MAbs) overlapped the 2F5 determinant, 
in competition experiments neither of these antibodies was blocked from binding to native 
protein by the 2F5 MAb. This indicates that while the two dimensional regions to which 
these antibodies map are similar, the three dimensional epitopes to which they bind are 

15 quite different 

The observation that only one neutralizing MAb (2F5) maps to the ectodomain of 
gp41 and that antibodies to the 2F5 epitope are poorly represented in sera from HIV 
infected individuals suggests that, for the most part, gp41 neutralizing epitopes are 
cryptic. The cryptic nature of these neutralizing epitopes is most likely related to the 

20 functional role of the TM in HIV-1 replication which involves mediating virus entry. It 
has been shown that prior to gpl20-CD4 binding the HIV envelope complex exists in a 
non-fusogenic form. While the exact nature of this pre-entry form is unknown, binding 
experiments have established that the non-fusogenic state is characterized by the 
inaccessibility of large portions of the gp41 ectodomain (Sattentau, Q. J. and J. P. Moore, 

25 J. Exp. Med. 774:407-415 (1991); Sattentau, Q. J., et al, Virol 206:713-717 (1995)). 
However, once binding of virus to target cell has occurred, the gpl20-gp41 complex 
undergoes a series of conformational changes that involve reorganization of both the 
extracellular surface component of the HIV-1 envelope protein (SU or gpl20) and TM 
proteins and the formation of structural components within the TM which are believed to 

30 be critical to virus entry. Although the steps involved in the transition from the 
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non-fusogenic to fusogenic state are largely unknown, it is believed that this 
transformation is characterized by the formation of a series of structural intermediates 
within the transmembrane protein which drive the conformational changes required for 
virus entry. The transitory nature of this event and the structures associated with it, rather 

5 than the absence of appropriate structural determinants, are believed to account for the 
poor neutralizing response to the TM component of the envelope system. 

Attention has been given to the development of vaccines for the treatment of HIV 
infection. The HIV-1 envelope proteins (gpl60, gpl20, gp41) have been shown to be the 
major antigens for anti-HIV antibodies present in ADDS patients (Barin, et al, Science 

10 228:1094-1096 (1985)). Thus far, these proteins seem to be the most promising 
candidates to act as antigens for anti-HIV vaccine development. To this end, several 
groups have begun to use various portions of gpl 60, gpl20, and/or gp4 1 as immunogenic 
targets for the host immune system. However, prior art attempts have thus far met with 
minimal success. 

1 5 Thus, although a great deal of effort is being directed to the design and testing of 

HIV vaccines, an effective vaccine is needed. 

Summary of the Invention 

An objective of the present invention is the induction and/or characterization of 
a humoral immune response targeting "entry-relevant" gp41 structures. In its broadest 

20 aspect, the present invention is directed to methods of raising a neutralizing antibody 
response to a broad spectrum of HIV strains and isolates. The present invention targets 
particular molecular conformations or structures that occur, or are exposed, following 
interaction of HIV with the cell surface during viral entry. Such a humoral response can 
be generated in vivo as a prophylactic or therapeutic measure in individuals to reduce or 

25 inhibit the ability of HIV to infect uninfected cells in the individual's body. Such a 
response can also be employed to raise antibodies against "entry relevant" gp4 1 structures. 
These antibodies can be subsequently employed for therapeutic uses, and as tools for 
further illuminating the mechanism of HIV cell entry. 
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One aspect of the present invention relates to a method of raising a broadly 
neutralizing antibody response to HIV by administering to a mammal a peptide or 
polypeptide comprising an amino acid sequence that is capable of forming a stable coiled- 
coil solution structure corresponding to or mimicking the heptad repeat region of gp41, 
(or the N-helical domain of gp41 ). Peptides of this aspect of the invention are exemplified 
by P-15 and P-17 described herein. 

A second aspect of the present invention relates to a method of raising a broadly 
neutralizing antibody response to HIV by administering to a mammal a peptide or 
polypeptide comprising an amino acid sequence that corresponds to, or mimics, the 
transmembrane-proximal amphipathic a-helical segment of gp41 (at the C-helical domain 
of gp4 1 ), or a portion thereof. Peptides of this aspect of the invention are exemplified by 
P-16 and P-18 described herein. 

A third aspect of the present invention relates to a method of raising a broadly 
neutralizing antibody response to HIV by administering to a mammal a composition 
including one or more peptides or polypeptides which comprise amino acid sequences that 
are capable of forming solution stable structures that correspond to, or mimic, the gp41 
core six helix bundle. This bundle forms in gp41 by the interaction of the distal regions 
(N-helical domain and C-helical domain) of the transmembrane protein. See FIG. 1. This 
aspect of the invention is also directed to novel mixtures of peptides and polypeptides, 
including multimeric and conjugate structures, wherein said mixtures and structures form 
a stable core helix solution structure. A preferred embodiment of this aspect of the 
invention involves raising antibodies to a physical mixture of N-helical domain peptide and 
C-helical domain peptide, for example, P-17 and P-18, P-15 and P-16, P-17 and P-16, or 
P-15 and P-18. 

The present invention is also directed to a method of raising a broadly neutralizing 
antibody response to HIV by administering to a mammal a composition including one or 
more novel peptides and proteins, herein referred to as conjugates, that mimic fusion- 
active transmembrane protein structures. These conjugates are formed from two or more 
amino acid sequences that comprise: 
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(a) one or more amino acid sequences that are capable of forming a stable coiled- 
coil solution structure corresponding to or mimicking the heptad repeat region of 
gp41 (N-helical domain); and 

(b) one or more amino acid sequences that correspond to, or mimic, an amino acid 
5 sequence of the transmembrane-proximal amphipathic a-helical segment of gp41 

(C-helical domain); 
wherein 

said one or more sequences (a) and (b) are alternately linked to one another via 
a bond, such as a peptide bond (amide linkage) or by an amino acid linking sequence 
10 consisting of about 2 to about 25 amino acids. These conjugates are preferably 
recombinantly produced. An example of such a conjugate is described in Example 5. 

In a preferred embodiment of this aspect of the invention, one or more of these 
conjugates folds and assembles in solution into a structure corresponding to, or 
mimicking, the gp41 core six helix bundle. 
15 The present invention also relates to methods for forming peptides, multimers and 

conjugates of the invention. 

The present invention also relates to pharmaceutical compositions comprising the 
peptides, multimers and conjugates of the invention and a pharmaceutical acceptable 
carrier. 

20 The present invention also relates to polyclonal and monoclonal antibodies that are 

raised to the peptides, multimers and conjugates described in the preceding paragraphs. 
The present invention also relates to a method of administering a composition 
comprising polyclonal or monoclonal antibodies described above to an individual in an 
amount effective to reduce HIV infection of uninfected cells. 

25 The present invention also relates to a vaccine for providing a protective response 

in an animal comprising one or more peptides, multimers or conjugates of the present 
invention together with a pharmaceutically acceptable diluent, carrier, or excipient, 
wherein the vaccine may be administered in an amount effective to elicit an immune 
response in an animal to HIV. In a preferred embodiment, the animal is a mammal. In 

30 another preferred embodiment, the mammal is a human. 
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Brief Description of the Figures 

FIG. 1 illustrates the structural and antigenic regions of HIV-1 gp4l. The 
extracellular, transmembrane and cytoplasmic domains are shown, as are the 
transmembrane-proximal amphipathic cc-helical segment of gp41 (C-helical domain) and 
the heptad repeat region of gp41 (N-helical domain). 

FIG. 2 illustrates the formation of multimeric peptide constructs corresponding to 
the heptad repeat region of gp41 (represented by P-17) and one or more suitable linker 
peptides. 

FIG. 3 illustrates the construction of conjugates of the invention derived from 
repeating gp 41 fragments; and their subsequent folding and interaction to form 
immunologically relevant epitopes. 

FIG. 4 depicts the analysis of polyclonal sera to various immunogens by surface 
immunoprecipitation. The precipitations were performed in the presence (+) or absence 
(-) of 10 ng/ml sCD4. 

FIG. 5 depicts analysis of polyclonal sera to various immunogens in neutralization 
assays. Immune sera or pre-immune (prebleed) sera were diluted 1 : 10 and incubated with 
various concentrations of virus (indicated in numbers of tissue culture infectious doses - 
TCID50). Levels of virus replication were measured by the amount of p24 in the 
supernatant seven days following infection, and normalized to the degree of replication 
in the absence of any rabbit serum. The positive (+ve) control used is a strongly 
neutralizing serum from an HIV-1 infected individual. 

FIG. 6: Percent neutralization for gp233 and gp234 sera in different experimental 
formats. FIG. 6a shows the titration of bleed 2 for each animal against HIV-1^ in the 
cell killing assay which uses cell viability as a measure of virus neutralization. MT2 cells 
are added to a mixture of virus (sufficient to result in greater than 80% cell death at 5 days 
post infection) and sera which had been allowed to incubate for approximately 1 hr. After 
5 days in culture, cell viability is measured by vital dye metabolism. FIG. 6b shows the 
percent neutralization for each bleed at a 1 : 1 0 dilution against HIV- 1 MN in an assay format 
employing GEM targets and p24 endpoint. In this assay, sera are incubated with 200 
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TCID 50 of virus for 1 hr prior to the addition of cells. On days 1, 3, and 5 media are 
changed. On day 7 culture supernatants are collected and analyzed for virus replication 
by p24 antigen levels. In each assay format, percent neutralization is determined by 
comparison of experimental wells with cell and cell/virus controls. 
5 FIG. 7 provides an example of a construct of the present invention (SEQ. ED 

NO:75) along with the corresponding nucleic acid sequence used for recombinant 
expression of the construct (SEQ. ID NO:76). 

Detailed Description of the Preferred Embodiments 

The transitory-nature of the HIV-entry event, and the structures associated with 

10 it, account for the seeming lack of neutralizing epitopes within gp41. These structural 
components, which form and function only during virus entry, and remain unexposed or 
are not present in the "native" fusion-inactive envelope complex, constitute a novel set of 
neutralizing epitopes within gp41. The present invention involves immunization with 
constructs mimicking these highly conserved, gp41 structures involved in virus entry to 

15 elicit the production of broadly neutralizing antibodies targeting these structures. Thus, 
this invention is the induction of a humoral immune response targeting these "entry 
relevant" gp41 structures. 

One aspect of the present invention relates to a method of raising a broadly 
neutralizing antibody response to HIV by administering to a mammal a peptide or 

20 polypeptide comprising an amino acid sequence that is capable of forming a stable coiled- 
coil solution structure corresponding to or mimicking the heptad repeat region of gp41 
which is located in the N-helical domain as defined herein. Peptides, or multimers thereof, 
that comprise amino acid sequences which correspond to or mimic solution conformation 
of the heptad repeat region of gp4 1 can be employed in this aspect of the invention. The 

25 heptad repeat region of gp4 1 includes 4 heptad repeats. Preferably, the peptides comprise 
about 28 to 55 amino acids of the heptad repeat region of the extracellular domain of HIV 
gp41 (N-helical domain, (SEQ. ID NO:l)), or multimers thereof. The peptides can be 
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administered as a small peptide, or conjugated to a larger carrier protein such as keyhole 
limpet hemocyanin (KLH), ovalbumin, bovine serum albumin (BSA) or tetanus toxoid. 

Alternatively, peptides forming a stable coiled-coil solution structure 
corresponding to or mimicking the heptad repeat region of gp4 1 can be employed to form 
polyclonal or monoclonal antibodies that can be subsequently administered as therapeutic 
or prophylactic agents. 

To determine whether a particular peptide or multimer will possess a stable 
trimeric coiled-coil solution structure corresponding to or mimicking the heptad repeat 
region of gp4 1 , the peptide can be tested according to the methods described in Wild, C, 
etaU Proc. Natl. Acad ScL USA 89: 10537-10541 (1992), fully incorporated by reference 
herein. 

Shown below is the sequence for residues of the HIV- 1^ gp4 1 protein that form 
the N-helical domain of the protein: 
ARQLLSGIVQQQNNIXRAE 

(SEQ. ID NO: 1) 

Two examples of useful peptides include the peptide P- 1 7, which has the formula, 
from amino terminus to carboxy terminus, of: 

NH 2 -NNLLRAEAQQHLLQLTVWGIKQLQARlLAVERYLKDQ-COOH 

(SEQ ID NO:2); 

and the peptide P-15, which has the formula, from amino terminus to carboxy terminus, 
of: 

NH 2 -SGWQQQNNLLRAEAQQHLLQLTVWGIKQLQARIL-COOH 

(SEQ ID NO:3). 

These peptides are optionally coupled to a larger carrier protein, or optionally include a 
terminal protecting group at the N- and/or C- termini. Useful peptides further include 
peptides corresponding to P- 17 or P-15 that include one or more, preferably 1 to 10 
conservative substitutions, as described below. A number of additional useful N-helical 
region peptides are described in the section entitled "Peptides." 

A second aspect of the present invention relates to a method of raising a broadly 
neutralizing antibody response to HIV by administering to a mammal a peptide or 
polypeptide comprising an amino acid sequence that corresponds to, or mimics, the 
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transmembrane-proximal amphipathic a-helical segment of gp41 (C-helical domain, (SEQ 
ID NO:4)), or a portion thereof. Useful peptides or polypeptides include an amino acid 
sequence that is capable of forming a core six helix bundle when mixed with a peptide 
corresponding to the heptad repeat region of gp41, such as the peptide P-17. Peptides 

5 can be tested for the ability to form a core six helix bundle employing the system and 
conditions described in Chan, D. C, et al t Cell 89:263-273 (1997); Lu, M., etal> Nature 
Struct, Biol 2: 1075-1082 (1995), fully incorporated by reference herein. 

Shown below is the amino acid sequence for residues of the HIV- 1 ^ gp4 1 protein 
that form the C-helical domain of the protein: 

10 WNNMTWMEWDREINNYTSLMS 

NITNW (SEQIDNO:4) 
Preferred peptides or multimers thereof, that can be employed in this aspect of the 
invention comprise about 6 or more amino acids, preferably about 24-56 amino acids, of 
the extracellular C-helical domain of HIV gp41. The peptides can be administered as a 

1 5 small peptide, or conjugated to a larger carrier protein such as keyhole limpet hemocyanin 
(KLH), ovalbumin, bovine serum albumin (BS A) or tetanus toxoid. This transmembrane- 
proximal amphipathic a-helical segment is exemplified by the peptides P-16 and P-18, 
described below. 

Alternatively, peptides or polypeptides comprising amino acid sequences that 
20 correspond to, or mimic, the transmembrane-proximal amphipathic a-helical segment of 
gp4 1 , or a portion thereof, can be employed to form polyclonal or monoclonal antibodies 
as therapeutic or prophylactic agents. 

Examples of useful peptides for this aspect of the invention include the peptide P- 
1 8 which corresponds to a portion of the transmembrane protein gp4 1 from the HIV- 1^, 
25 isolate, and has the 36 amino acid sequence (reading from amino to carboxy terminus): 
NH 2 -YTSLfflSLIEESQNQQEKNEQELLELDKWASLWNWF-COOH 

(SEQ ID NO:5); 

and the peptide P-16, which has the following amino acid sequence (reading from amino 
to carboxy terminus): 
30 NH r WMEWDREn^>rrrSLIHSLIEESQNQQEKNEQELL-COOH 

(SEQ ID NO:6) 
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These peptides are optionally coupled to a larger carrier protein. Useful peptides further 
include peptides corresponding to P- 18 or P- 16 that include one or more, preferably 1 to 
10 conservative substitutions, as described below. In addition to the full-length P- 18, 36- 
mer and the full length P-16, the peptides of this aspect of the invention may include 
5 truncations of the P-18 and P-16, as long as the truncations is capable of forming a six 
helix bundle when mixed with P- 17. A number of other useful peptides are described in 
the section entitled "Peptides," below. 

A third aspect of the present invention relates to a method of raising a broadly 
neutralizing antibody response to HIV by administering to a mammal a composition 

10 including one or more peptides or polypeptides which comprise amino acid sequences that 
are capable of forming solution stable structures that correspond to, or mimic, the gp41 
core six helix bundle. This bundle forms in gp41 by the interaction of the distal regions 
of the transmembrane protein, the heptad repeat region and the amphipathic a-helical 
region segment roughly corresponding to the N-helical domain and C-helical domain. See 

15 FIG. 1. The bundle structures that form in native virus are the result of a trimeric 
interaction between three copies each of the heptad repeat region and the transmembrane- 
proximal amphipathic a-helical segment. In the compositions of the present invention, 
peptide regions interact with one another to form a core six helix bundle. This aspect of 
the invention is also directed to novel mixtures of peptides and polypeptides, including 

20 multimeric and conjugate structures, wherein said structures form a stable core helix 
solution structure. 

This aspect of the invention can employ mixtures of (a) one or more peptides that 
comprise an amino acid sequence that corresponds to, or mimics, a stable coiled coil 
heptad repeat region of gp41; and (b) one or more peptides that comprise a region that 

25 corresponds to, or mimics, the transmembrane-proximal amphipathic a-helical segment 
of gp4 1 . These mixtures are optionally chemically or oxidatively cross-linked to provide 
additional immunogenic structures that may or may not be solution stable. In addition to 
physical mixtures, and conventional cross-linking, the peptides (a) and (b) can be 
conjugated together via suitable linking groups, preferably a peptide residue having at 

30 least 2, preferably 2 to 25, amino acid residues. Preferred linking groups are formed from 
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combinations of glycine and serine, or combinations of glycine and cysteine when further 
oxidative cross-linking is envisioned. 

A preferred embodiment of this aspect of the invention involves raising antibodies 
to physical mixtures of P-17 and P-18, P-15 and P-16, P-17 and P-16 or P-15 and P-18. 

The present invention is also directed to a method of raising a broadly neutralizing 
antibody response to HIV by administering to a mammal a composition including one or 
more novel peptides and proteins, herein referred to as conjugates, that mimic fusion- 
active transmembrane protein structures. These conjugates are formed from peptides and 
proteins that comprise: 

(a) one or more amino acid sequences of 28 or more amino acids that are capable 
of forming a stable coiled-coil solution structure corresponding to or mimicking 
the heptad repeat region of gp41; and 

(b) one or more amino acid sequences that correspond to, or mimic, an amino acid 
sequence of the transmembrane-proximal amphipathic a-helical segment of gp4 1 ; 

wherein 

said one or more sequences (a) and (b) are alternately linked to one another via 
a peptide bond (amide linkage) or by an amino acid linking sequence consisting of about 
2 to about 25 amino acids. These peptides and proteins are preferably recombinantly 
produced. 

In a preferred embodiment of this aspect of the invention, one or more of these 
conjugates folds and assembles into a structure corresponding to, or mimicking, the gp41 
core six helix bundle. 

Non-limiting examples of the novel constructs or conjugates that can be formed 
include: 

(1) three tandem repeating units consisting of P-17-linker-P-18 
(P-17-linker-P-18-linker-P-17-linker-P-184inker-P-17-linker-P-18), 

(2) P-17-linker-P-18-linker-P-17, 

(3) P- 1 8-linker-P- 1 7-linker-P- 1 8, 

(4) P-17-linker-P-17, 
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(5) three tandem repeating units consisting of P-15-linker-P-16 
(P- 1 5-linker-P- 1 6-linker-P- 1 5-linker-P- 1 6-linker-P- 1 5-linker-P- 16), 

(6) P- 15-linker-P-l 6-linker-P- 1 5, 

(7) P-16-linker-P-15-linker-P-16, and 
5 (8) P-16-linker-P-15; 

wherein each linker is an amino acid sequence, which may be the same or different, of 
from about 2 to about 25, preferably 2 to about 16 amino acid residues. Preferred amino 
acid residues include glycine and serine, for example (GGGGS) X , (SEQ ID NO:7) wherein 
x is 1, 2, 3, 4, or 5, or glycine and cysteine, for example (GGC)y, where y is 1, 2, 3 4 or 

10 5. In any of the described constructs, P-15 and P-17 are interchangeable and P-16 and 
P- 18 are interchangeable. An example of such a construct (SEQ ID NO:77) is shown in 
FIG. 7, along with the corresponding nucleic acid sequence (SEQ ID NO:78) used for 
recombinant expression of the construct. 

Alternatively, polyclonal or monoclonal antibodies can be raised against the 

15 immunogenic mixtures and conjugates described in this aspect of the invention. Such 
antibodies can be employed as therapeutic or prophylactic agents. 

In preferred aspects of the invention, the methods can be employed to immunize 
an HIV- 1 -infected individual such that levels of HIV- 1 will be reduced in such individual. 
In another aspect, the methods can be employed to immunize a non-HIV-1 -infected 

20 individual so that, following a subsequent exposure to HIV- 1 that would normally result 
in HIV-1 infection, the levels of HIV-1 will be non-detectable using current diagnostic 
tests. 

Immunogen Preparation 

Induction and interpretation of a humoral immune response directed against gp41 
25 structural epitopes requires both immunogen preparation and antibody characterization. 
Synthetic peptides and recombinant proteins can both be used to generate antigenic 
structures corresponding to gp41 fusion active domains. 
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In one aspect of the invention, target immunogens model the heptad repeat region 
delineated by the P-17 peptide (capable of forming a trimeric coiled-coil structure). In 
another aspect of the invention, target immunogens model the transmembrane-proximal 
. amphipathic a-helical segment delineated by the P- 1 8 peptide. This region in the absence 
5 of the coiled-coil core exhibits random coil solution structure. (Wild, C, etai, Proc. Natl. 
Acad. Sci. USA 59:10537-10541 (1992); Wild, C, et al., AIDS Res. Hum. Retroviruses 
9:1051-1053 (1993); Wild, C, et al, Proc. Natl. Acad. ScL USA 9/:9770-9774 (1994)). 
In another aspect, combinations of these target immunogens are employed for raising 
antibodies. 

10 In another aspect of the invention the target immunogen is the six helix 

hydrophobic bundle. This bundle is formed by the specific association of these two distal 
regions of the ectodomain of gp4 1 (Chan, D. C, et al t Cell £9:263-273 (1997); Lu, M., 
et al, Nature Struct. Biol. 2:1075-1082 (1995)). These constructs will mimic entry 
determinants which form and function during HIV-l entry. 

1 5 Synthetic Methods of Immunogen Preparation 

Immunogens can be prepared by several different routes. The constructs can be 
generated from synthetic peptides. This involves preparing each sequence as a peptide 
monomer followed by post-synthetic modifications to generate the appropriate oligomeric 
structures. The peptides are synthesized by standard solid-phase methodology. To 

20 generate a trimeric coiled-coil structure, the P-17 peptide monomer is solubilized under 
conditions which favor oligomerization. These conditions include a 20 mM phosphate 
buffer, pH 4.5 and a peptide concentration of 100 jiM (Wild, C, et al, Proc. Natl Acad. 
ScL t/SAS9: 10537- 10541 (1992)). The structure which forms under these conditions can 
be optionally stabilized by chemical crosslinking, for example using gluteraldehyde. 

25 Alternatively, a protocol which makes use of intermolecular disulfide bond 

formation to stabilize the trimeric coiled-coil structure can be employed in order to avoid 
any disruptive effect the cross-linking process might have on the structural components 
of this construct. This approach uses the oxidation of appropriately positioned cysteine 
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residues within the peptide sequence to stabilize the oligomeric structure. This requires 
the addition of a short linker sequence to the N terminus of the P- 1 7 peptide. The trimeric 
coiled-coil structure which is formed by this approach will be stabilized by the interaction 
of the cysteine residues (FIG. 2). The trimer is separated from higher order oligomeric 

5 forms, as well as residual monomer, by size exclusion chromatography and characterized 
by analytical ultracentrifugation. These covalently stabilized coiled-coil oligomers serve 
as the core structure for preparation of a six helix bundle. 

To accomplish preparation of a six helix bundle, an excess of P-18 peptide is 
added to the purified core structure. After incubation the reaction mixture is subjected 

10 to a cross-linking procedure to stabilize the higher order products of the specific 
association of these two peptides. The desired material is isolated by size exclusion 
chromatography and characterized by analytical ultracentrifugation. The immunogen 
corresponding only to the P- 1 8 peptide requires no specific post-synthetic modifications. 
Using this approach, three separate target constructs are generated rapidly and in large 

15 amounts. 

Recombinant Methods of Immunogen Preparation 

Another method for preparing target immunogens involves the use of a bacterial 
expression vector to generate recombinant gp41 fragments. The use of an expression 
vector to produce the peptides and polypeptides capable of forming the entry-relevant 

20 immunogens of the present invention adds a level of versatility to immunogen preparation. 

New and modified forms of the antigenic targets are contemplated as the structural 
determinants of HIV- 1 entry are better understood. The recombinant approach readily 
accommodates these changes. Also, this method of preparation allows for the ready 
modification of the various constructs (i.e. the addition of T- or B-cell epitopes to the 

25 recombinant gp4 1 fragments to increase immunogenicity). In addition, a form of the six 
helix hydrophobic core structure is generated , which will not require additional 
stabilization, since determining the antigenic nature of this structure is important. Finally, 
these recombinant constructs can be employed as a tool to provide valuable insights into 
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additional structural components which form and function in gp41 during the process of 
virus entry. 

Thus, as part of the invention, novel fusion polypeptides (conjugates) are also 
provided, as are vectors, host cells and recombinant methods for producing the same. The 
present invention provides isolated nucleic acid molecules comprising a polynucleotide 
encoding the conjugates of the invention. 

The present invention also relates to recombinant vectors, which include the 
isolated nucleic acid molecules of the present invention, and to host cells containing the 
recombinant vectors, as well as to methods of making such vectors and host cells and for 
using them for production of fusion polypeptides or peptides by recombinant techniques. 

The polynucleotides may be joined to a vector containing a selectable marker for 
propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such as 
a calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is a 
virus, it may be packaged in vitro using an appropriate packaging cell line and then 
transduced into host cells. 

The DN A insert should be operatively linked to an appropriate promoter, such as 
that described herein. Other suitable promoters will be known to the skilled artisan. The 
expression constructs will further contain sites for transcription initiation, termination and, 
in the transcribed region, a ribosome binding site for translation. The coding portion of 
the mature transcripts expressed by the constructs will preferably include a translation 
initiating at the beginning and a termination codon (UAA, UGA or UAG) appropriately 
positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one selectable 
marker. Such markers include dihydrofolate reductase or neomycin resistance for 
eukaryotic cell culture and tetracycline or ampicillin resistance genes for culturing in E. 
coli and other bacteria. Representative examples of appropriate hosts include, but are not 
limited to, bacterial cells, such as E. coli, Streptomyces and Salmonella typhimurium cells; 
fungal cells, such as yeast cells; insect cells such as Drosophila S2 and Spodoptera Sf9 
cells; animal cells such as CHO, COS and Bowes melanoma cells; and plant cells. 
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Appropriate culture mediums and conditions for the above-described host cells are known 
in the art. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 

5 transfection, electroporation, transduction, infection or other methods. Such methods are 
described in many standard laboratory manuals, such as Davis et aL, Basic Methods In 
Molecular Biology (1986). 

The polypeptide may be expressed in a modified form, such as a fusion protein, 
and may include not only secretion signals, but also additional heterologous functional 

10 regions. For instance, a region of additional amino acids, particularly charged amino 
acids, may be added to the N-terminus of the polypeptide to improve stability and 
persistence in the host cell, during purification, or during subsequent handling and storage. 
Also, peptide moieties may be added to the polypeptide to facilitate purification. Such 
regions may be removed prior to final preparation of the polypeptide. The addition of 

1 5 peptide moieties to polypeptides to engender secretion or excretion, to improve stability 
and to facilitate purification, among others, are familiar and routine techniques in the art. 

The fusion protein can be recovered and purified from recombinant cell cultures 
by well-known methods including ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose chromatography, 

20 hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography and lectin chromatography. Most preferably, high performance liquid 
chromatography ("HPLC") is employed for purification. Depending upon the host 
employed in a recombinant production procedure, the polypeptides of the present 
invention may be glycosylated or may be non-glycosylated. In addition, polypeptides of 

25 the invention may also include an initial modified methionine residue, in some cases as a 
result of host-mediated processes. 

A bacterial expression vector (kindly provided by Dr. Terrance Oas, Duke 
University) was developed specifically for the expression of small proteins. This plasmid, 
pTCLE-G2C, is based on pAED-4, a T7 expression vector. A modified TrpLE (Yansura, 

30 D. G., Methods Enzymol. 755:161-166 (1990)) fusion peptide (provided by Dr. Peter 
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Kim) was inserted after the T7 promoter (Studier, F. W., et aL, Methods EnzymoL 
785:60-89 (1990)). There is an in frame Nde I site at the end of the TrpLE peptide that 
encodes a methionine cyanogen bromide (CNBr) cleavage site. This vector was used in 
an earlier study to express a recombinant form of the P-17 peptide (Calderone, T. L M et 
5 aL, J. MoL Biol. 262:401 All (1996)) and has been modified to expresses the P-18 
peptide. 

To generate a six helix hydrophobic core structure, several combinations of the 
heptad repeat (for example, P-17 or P-15) region and the amphipathic a-helical (for 
example, P- 1 6 or P- 1 8) segment of gp4 1 are separated by a flexible linker of amino acid 
10 residues. For example, (GGGGS) 3 (SEQ ID NO:7) can be encoded into the vector. This 
is accomplished by standard PCR methods. The (GGGGS) 3 (SEQ ID NO:7) linker motif 
is encoded by a synthetic oligonucleotide which is ligated between the P-17 and P-18 
encoding regions of the expression vector. 

All constructions are characterized by multiple restriction enzyme digests and 
1 5 sequencing. The success of this approach to attain multicomponent interactions has been 
recently demonstrated (Huang, B., et aL, J. Immunol. 758:216-225 (1997)). 

Examples of the novel constructs or conjugates that can be formed by the method 
are described above. 

Based on the parallel orientation of the subunits of the coiled coil core and the 
20 antiparallel orientation of the amphipathic a-helical segment in the six helix bundle, these 
constructs fold to generate the desired structures (See, FIG. 3.). Following expression, 
the recombinant gp4 1 fragments are isolated as inclusion bodies, cleaved from the leader 
sequence by cyanogen bromide, and separated from the leader by-product by size 
exclusion chromatography step (SUPERDEX 75). This protocol has been successfully 
25 used in the purification of large quantities of a modified form of the P-17 peptide 
(Calderone, T. L., et aL, J. MoL BioL 262:407-412 (1996)). Recombinant constructs (2) 
and (3) are mixed in equalmolar quantities under non-denaturing conditions to generate 
' a six -helix hydrophobic core structure. Constructs (1) and (4) will fold either intra- or 
intermolecularly to generate the same or similar structures (see FIG. 3 for the folding 
30 process). The desired product is purified by size exclusion chromatography on a 
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SUPERDEX 75 FPLC column and characterized by molecular weight under using a 
Beckman Model XL-A analytical ultracentrifuge. 

Definitions 

The phrase "entry-relevant" as employed herein, refers to particular molecular 
5 conformations or structures that occur or are exposed following interaction of HIV with 
the cell surface during viral entry, and the role of particular amino acid sequences and 
molecular conformations or structures in viral entry. 

The term "neutralizing" as employed herein refers to the ability to inhibit entry of 
HIV into cells, including an amount of inhibition that is useful for reducing or preventing 
10 infection of uninfected cells by the virus. 

The term "HIV" as used herein refers to all strains and isolates of human 
immunodeficiency virus type 1 . The constructs of the invention were based upon HIV- 1 
gp41, and the numbering of amino acids in HIV proteins and fragments thereof given 
herein is with respect to the HIV-l,^ isolate. However, it is to be understood, that while 
1 5 HIV- 1 viral infection and the effects of the present invention on such HIV- 1 infection are 
being used herein as a model system, the entry mechanism that is being targeted is relevant 
to all strains and isolates of HIV-1. Hence the invention is directed to "broadly 
neutralizing" methods. 

The phrase "heptad repeat" or "heptad repeat region" as employed herein, refers 
20 to a common protein motif having a 4-3 repeat of amino acids, commonly leucine and/or 
isoleucine, and is often associated with alpha-helical secondary structure. The 'heptad 
repeat" can be represented by the following sequence: 

-A Aj - A A 2 ~ AA^- A A4-A A^- AA^- AAy- 

where AA, and AA 4 are each one of leucine or isoleucine; while AA 2 , AA 3 , AA 5 , AA 6 , and 
25 AA 7 can be any amino acid. See, Wild, C, et al, Proc. Natl Acad. Sci. USA 89: 10537- 
10541 (1992). 
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Peptides are defined herein as organic compounds comprising two or more amino 
acids covalently joined by peptide bonds. Peptides may be referred to with respect to the 
number of constituent amino acids, i.e., a dipeptide contains two amino acid residues, a 
tripeptide contains three, etc. Peptides containing ten or fewer amino acids may be 
referred to as oligopeptides, while those with more than ten amino acid residues are 
polypeptides. 

Peptides 

The complete gp41 amino acid sequence (HTV-1 Group M: Subtype B Isolate: 
LAI, N to C termini) is: 

AVGIGALFLGFIXjAAGSTMGARSMTLTVQARQLL^GIVQQQNNLLRAIEA 
QQHLLQLTVWGIKQLQARILAVERYLKDQQLLGIWGGSGKLICTTAVPWNAS 
WSNKSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQNQQEK 
NEQELLELDKWASLWNWFNITNWLW 

NRVRQGYSPLSFQTHLP-TPRG-PDRPEGIEEEGGERDRDRSIRLVNGSL 

ALIWDDLRSLCLFSYHRLRDLLLIVTRIVELLGRRGWEALKYWW 

NLLQYWSQELKNSAVSLLNATAIAVAEGTDRVIEVVQGACRAIRHIPRRIRQG 

LERILL. (SEQ ID NO:8) 

N-terminal helix region: 

ARQLLSGWQQQNNIXRAE^^ 

(SEQ ID NO: 1) 

Shown below is the sequence for residues 558-595 (SEQ ID NO:7) of the 
HIV-luu gp4l protein in the N-helical domain of the protein. The a and d subscripts 
denote the 4-3 positions of the heptad repeat. 

nnllraxeaqqhllqltvwgikqlqarilaverylkdq 

d ad ad ad ad a 

571 578 585 (SEQ ID NO: 2) 
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C-terminal helix region: 

WNNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 
NITNW (SEQIDNO:4) 



Shown below is the amino acid sequence for residues 643-678 of the HIV-l^, 
gp41 protein in the C-helical domain of the protein. 

YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 
dadad adad a 

647 654 661 (SEQ ID NO:5) 

Unlike the N-helix, when modeled as a peptide, the C-helical region of gp4 1 is not 
structured. However, when mixed with the N-peptide, the C-peptide does takes on 
a-helical structure as part of the core structure complex. The structure forms in vitro on 
mixing the peptides and can be characterized spectrophotometrically (Lu, ML, et al , Nat. 
Struct Biol 2: 1075- 1 082 ( 1 995)). The initial determination of the effect of the mutations 
on C-helix structure may be performed by analyzing the ability of the mutant C-peptide 
to interact with the N-peptide and form the six -helix bundle. This analysis may be carried 
out using circular dichroism. N-helical and C-helical domain peptides can be constructed 
from multiple strains of HIV, and can include deletions, insertions and substitutions that 
do not destroy the ability of the resulting peptide to elicit antibodies when employed alone 
or in combination with other peptides of the invention. 

Examples of N-helical Domain Peptide Sequences (All sequences are listed from 
N-terminus to C-terminus.) from different HIV strains include, but are not limited to the 
following peptides: 

HIV-1 Group M: Subtype B Isolate: LAI 
ARQIXSGTVQQQNNLIitAIEAQ 

(SEQ ID NO: 1) 
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SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ 

(SEQ ID NO:9) 

P15 SGIVQQQNNLLRA1EAQQHLLQLTVWGIKQLQARIL (SEQIDNO:3) 
P-17 NNLLRAffiAQQHLLQLTVWGIKQLQARILAVERYLKDQ 

(SEQIDNO:2) 

Subtype B Isolate: ADA 

SGIVQQQNNLLRAEEAQQHLLQLTVWGIKQLQARVLALERYLRDQ 

(SEQ ID NO: 10) 

SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARVL (SEQ ID NO: 11) 

NNLLRAEAQQHLLQLT VWGIKQLQAR VLALER YLRDQ (SEQ ID NO: 12) 

Subtype B Isolate: JRFL 

SGIVQQQNNLLRAIEAQQRMLQLTVWGIKQLQARVLAVERYLGDQ 

(SEQ ID NO: 13) 

SGIVQQQNNLLRAIEAQQRMLQLTVWGIKQLQARVL (SEQ ID NO: 14) 

NNLLRAIEAQQRMLQLTVWGIKQLQARVLAVERYLGDQ (SEQ ID NO: 15) 

Subtype B Isolate: 89.6 

SGIVQQQNNLLRAIEAQQHMLQLTVWGIKQLQAR VLALER YLRDQ 

(SEQ ID NO: 16) 

SGIVQQQNNLLRAIEAQQHMLQLTVWGIKQLQARVL (SEQ ID NO: 17) 

NNLLRAEAQQHMLQLTVWGIKQLQARVLALER YLRDQ (SEQ ID NO: 18) 

Subtype C Isolate: BU910812 

SGIVQQQSNLLRAEEAQQHMLQLTVWGIKQLQARVLAIER YLRDQ 

(SEQ ID NO: 19) 

' SGIVQQQSNLLRAIEAQQHMLQLTVWGIKQLQARVL (SEQ ID NO:20) 

SNLLRAIEAQQHMLQLTVWGIKQLQARVLAIER YLRDQ (SEQ ID NO:21) 
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Subtype D Isolate: 92UG024D 

SGTVQQQNNLLRAEAQQHLLQLTVWGIKQLQARVLAVESYLKDQ 

(SEQ ID NO:22) 

SGIVQQQNNLLR AIE AQQHLLQLT VWGIKQLQARVL (SEQ ID NO: 1 1 ) 

NNLLRAffiAQQHLLQLTVWGIKQLQARVLAVESYLKDQ (SEQIDNO:23) 

Subtype F Isolate: BZ163A 

SGIVQQQSNLLRAIEAQQHLLQLTVWGIKQLQARVLAVERYLQDQ 

(SEQ ID NO:24) 

SGIVQQQSNLLRAIEAQQHLLQLTVWGIKQLQARVL (SEQ ID NO:25) 

SNLLRAIEAQQHLLQLTVWGIKQLQARVLAVERYLQDQ (SEQ ID NO:26) 

Subtype G Isolate: FI.HH8793 

SGIVQQQSNLLRAIEAQQHLLQLTVWGIKQLQARVLALERYLRDQ 

(SEQ ID NO:27) 

SGIVQQQSNLLRAIEAQQHLLQLTVWGIKQLQARVL (SEQ ID NO:25) 

SNLLRAIEAQQHLLQLTVWGIKQLQARVLALERYLRDQ (SEQ ID NO:28) 

Subtype H Isolate: BE.VI997 

SGIVQQQSNLLRAIQAQQHMLQLTVWGVKQLQARVLAVERYLKDQ 

(SEQ ID NO:29) 

SGIVQQQSNLLRAIQAQQHMLQLTVWGVKQLQARVL (SEQ ID NO:30) 

SNLLRAIQAQQHMLQLTVWGVKQLQARVLAVERYLKDQ (SEQ ID NO:3 1) 

Subtype J Isolate: SE.SE92809 

SGIVQQQSNLLKAIEAQQHLLKLTVWGIKQLQARVLAVERYLKDQ 

(SEQ ID NO:32) 

SGIVQQQSNLLKAIEAQQHLLKLTVWGIKQLQARVL (SEQ ID NO:33) 

SNLLKAIEAQQHLLKLTVWGIKQLQARVLAVERYLKDQ (SEQ ID NO:34) 
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Group N Isolate: CM.YBF30 

SGrVQQQNILLRAffiAQQHLLQLSIWGIKQLQAKVLAlERYLRDQ 

(SEQE)NO:35) 

SGIVQQQNILLRAEAQQHLLQLSIWGIKQLQAKVL (SEQ ID NO:36) 

NILLRAEAQQHLLQLSIWGIKQLQAKVLAIERYLRDQ (SEQ ID NO:37) 

Group O Isolate: CM.ANT70C 

KGIVQQQDNLLRAIQAQQQLLRLSxWGIRQLRARLLALETLLQNQ 

(SEQ ID NO:38) 

KGIVQQQDNLLRAIQAQQQLLRLSxWGIRQLRARL (SEQ ID NO:39) 

DNLLRAIQAQQQLLRLSxWGIRQLRARLLALETLLQNQ (SEQ ID NO:40) 

Examples of C-helical Domain Peptide Sequences (All sequences are listed from 
N-terminus to C-terminus.) from different HIV strains include, but are not limited to the 
following peptides: 

HIV-1 Group M: Subtype B Isolate: LAI 

WNNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 
NITNW (SEQE>NO:4) 
WMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 

(SEQ ID NO:41) 

P16 WMEWDREINNYTSLIHSLIEESQNQQEKNEQELL (SEQ ID NO:6) 

P- 1 8 YTSLfflSLEESQNQQEKNEQELLELDKWASLWNWF 

(SEQ ID NO:5) 

Subtype B Isolate: ADA 

WMEWEREIENYTGLIYTLIEESQNQQEKNEQDLLALDKWASLWNWF 

(SEQ ID NO:42) 

WMEWEREENYTGLIYTLIEESQNQQEKNEQDLL (SEQ ID NO:43) 

YTGLIYTLIEESQNQQEKNEQDLLALDKWASLWNWF (SEQ ID NO:44) 
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Subtype B Isolate: JRFL 

WMEWEREDDNYTSEIYTLIEESQNQQEKNEQELLELDKWASLWNWF 

(SEQ ID NO:45) 

WMEWEREE)N YTSEIYTLIEES QNQQEKNEQELL (SEQ ID NO:46) 

YTSEIYTLIEESQNQQEKNEQELLELDKWASLWNWF (SEQ ID NO:47) 

Subtype B Isolate: 89.6 

WMEWEREIDNYTDYIYDLLEKSQTQQEKNEKELLELDKWASLWNWF 

(SEQ ID NO:48) 

WMEWEREIDNYTDYIYDLLEKSQTQQEKNEKELL (SEQIDNO:49) 
YTDYIYDLLEKSQTQQEKNEKELLELDKWASLWNWF , (SEQ ID NO:50) 

Subtype C Isolate: BU910812 

WIQWDREISNYTGHYRLLEESQNQQENNEKDLLALDKWQNLWSWF 

(SEQIDNO:51) 

WIQWDREISNYTGHYRLLEESQNQQENNEKDLL (SEQ ID NO:52) 

YTGIIYRLLEESQNQQENNEKDLLALDKWQNLWSWF (SEQ ID NO:53) 

Subtype D Isolate: 92UG024D 

WMEWEREISNYTGLIYDLIEESQIQQEKNEKDLLELDKWASLWNWF 

(SEQ ID NO:54) 

WMEWEREISNYTGLIYDLIEESQIQQEKNEKDLL (SEQ ID NO:55) 

YTGLIYDLIEESQIQQEKNEKDLLELDKWASLWNWF (SEQ ID NO:56) 

Subtype F Isolate: BZ163A 

WMEWQKEISNYSNEVYRLEKSQNQQEKNEQGLLALDKWASLWNWF 

(SEQ ID NO:57) 

WMEWQKEISNYSNEVYRLEEKSQNQQEKNEQGLL (SEQ ID,NO:58) 

YSNEVYRLIEKSQNQQEKNEQGLLALDKWASLWNWF (SEQ ID NO:59) 
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Subtype G Isolate: FI.HH8793 

WIQWDREISNYTQQIYSLBEESQNQQEKNEQDLLALDNWASLWTWF 

(SEQ ID NO:60) 

WIQWDREISNYTQQIYSLDEESQNQQEKNEQDLL (SEQ ED NO:61) 

YTQQIYSLEEESQNQQEKNEQDLLALDNWASLWTWF (SEQ ID NO:62) 

Subtype H Isolate: BE.VI997 

WMEWDRQIDNYTEV1YRLLELSQTQQEQNEQDLLALDKWDSLWNWF 

(SEQ ID NO:63) 

WMEWDRQIDN YTEVIYRLLELS QTQQEQNEQDLL (SEQ ID NO:64) 

YTEVIYRLLELSQTQQEQNEQDLLALDKWDSLWNWF (SEQ ED NO:65) 

Subtype J Isolate: SE.SE92809 

WIQWEREENNYTGIEYSLEEEAQNQQENNEKDLLALDKWTNLWNWFN 

(SEQ ED NO:66) 

WIQWEREINNYTGHYSLEBEAQNQQENNEKDLL (SEQ ED NO:67) 

YTGETYSLEEEAQNQQENNEKDLLALDKWTNLWNWFN (SEQ ED NO:68) 

Group N Isolate: CM.YBF30 

WQQWDEKVRNYSGVEFGLEEQAQEQQNTNEKSLLELDQWDSLWSWF 

(SEQ ED NO:69) 

WQQWDEKVRNYSGVIFGLIEQAQEQQNTNEKSLL (SEQ ED NO:70) 

YSGVIFGLIEQAQEQQNTNEKSLLELDQWDSLWSWF (SEQ ID NO:71) 

Group O Isolate: CM.ANT70C 

WQEWDRQISNISSTEYEEIQKAQVQQEQNEKKLLELDEWASIWNWL 

(SEQ ID NO:72) 

WQEWDRQIS NIS STEYEEIQKAQ VQQEQNEKKLL (SEQ ID NO:73) 

ISSTIYEEIQKAQVQQEQNEKKLLELDEWASIWNWL (SEQ ID NO:74) 



WO 00/40616 



-29- 



PCT/US00/00456 



The peptides and conjugates of the present invention may be acylated at the NH 2 
terminus, and may be amidated at the COOH terminus. 

The peptides and conjugates of the invention may include conservative amino acid 
substitutions. Conserved amino acid substitutions consist of replacing one or more amino 
acids of the peptide sequence with amino acids of similar charge, size, and/or 
hydrophobicity characteristics, such as, for example, a glutamic acid (E) to aspartic acid 
(D) amino acid substitution. When only conserved substitutions are made, the resulting 
peptide is functionally equivalent to the peptide from which it is derived. 

Peptide sequences defined herein are represented by one-letter symbols for amino 
acid residues as follows: 



A 


alanine 


L 


leucine 


R 


arginine 


K 


lysine 


N 


asparagine 


M 


methionine 


D 


aspartic acid 


F 


phenylalanine 


C 


cysteine 


P 


proline 


Q 


glutamine 


S 


serine 


E 


glutamic acid 


T 


threonine 


G 


glycine 


W 


tryptophan 


H 


histidine 


Y 


tyrosine 


I 


isoleucine 


V 


valine 



The peptides and conjugates of the invention may include amino acid insertions 
which consist of single amino acid residues or stretches of residues ranging from 2 to 15 
amino acids in length. One or more insertions may be introduced into the peptide, peptide 
fragment, analog and/or homolog. 

The peptides and conjugates of the invention may include amino acid deletions of 
the full length peptide, analog, and/or homolog. Such deletions consist of the removal of 
one or more amino acids from the full-length peptide sequence, with the lower limit length 
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of the resulting peptide sequence being 4 to 6 amino acids. Such deletions may involve 
a single contiguous portion or greater than one discrete portion of the peptide sequences. 

The peptides of the invention may be synthesized or prepared by techniques well 
known in the art. See, for example, Creighton, Proteins: Structures and Molecular 
Principles, W.H. Freeman & Co., New York, NY (1983), which is incorporated herein 
by reference in its entirety. Short peptides, for example, can be synthesized as a solid 
support or in solution. Longer peptides may be made using recombinant DN A techniques. 
Here, the nucleotide sequences encoding the peptides of the invention may be synthesized, 
and/or cloned, and expressed according to techniques well known to those of ordinary 
skill in the art. See, for example, Sambrook, et a/., Molecular Cloning, A Laboratory 
Manual, Vols. 1-3, Cold Spring Harbor Press, Cold Spring Harbor, NY (1989). 

In yet another embodiment of the invention, peptides comprising the sequences 
described above may be synthesized with additional chemical groups present at their 
amino and/or carboxy termini, such that, for example, the stability, bioavailability, and/or 
immunogenic activity of the peptides is enhanced. For example, hydrophobic groups such 
as carbobenzoxy, dansyl, or t-butyloxycarbonyl groups, may be added to the peptides' 
amino termini. Likewise, an acetyl group or a 9-fluorenylmethoxy-carbonyl group may 
be placed at the peptides' amino termini. Additionally, the hydrophobic group 
t-butyloxycarbonyl, or an amido group may be added to the peptides' carboxy termini. 
In one preferred embodiment, carrier proteins, such as keyhole limpet hemocyanin, 
ovalbumin, BS A or tetanus toxoid are added to the peptide. 

With reference to the peptides P-17 and P-18, deletion mutants are further 
described. 

The peptide P-18 corresponds to amino acid residues 638 to 673 of the 
transmembrane protein gp41 from the HIV-l^, isolate: 

In addition to the full-length C-helical peptides identified above, useful peptides 
of the invention may include truncations of the C-helical peptides (SEQ ID NO:4) which 
exhibit the ability to raise neutralizing antibodies or form a six-helix hydrophobic core 
structure under conditions described herein. Such truncated peptides may comprise 
peptides of between 3 and 56 amino acid residues, i.e., peptides ranging in size from a 
tripeptide to a 56-mer polypeptide. As an example, such peptides are listed for P-18 in 
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Tables I and II, below. Peptide sequences in these tables are listed from amino (left) to 
carboxy (right) terminus. "X" may represent an amino group (-NH 2 ) and "Z" may 
represent a carboxyl (-COOH) group. Alternatively, as described below, "X" and/or "Z" 
may represent a hydrophobic group, an acetyl group, a FMOC group, an amido group, 
or a covalently attached macromolecule. 

TABLE I 

Carboxy Truncations of SEQ ID NO:5 

X-YTS-Z 

X-YTSL-Z 

X-YTSLI-Z 

X-YTSLIH-Z 

X-YTSLIHS-Z 

X-YTSLIHSL-Z 

X-YTSLIHSLI-Z 

X-YTSLIHSLIE-Z 

X-YTSLIHSLIEE-Z 

X-YTSLIHSLIEES-Z 

X-YTSLIHSLIEESQ-Z 

X-YTSLIHSLIEESQN-Z 

X-YTSLIHSLIEESQNQ-Z 

X-YTSLIHSLIEESQNQQ-Z 

X-YTSLIHSLIEESQNQQE-Z 

X- YTSLIHSLIEESQNQQEK- Z 

X-YTSLIHSLIEESQNQQEKN-Z 

X-YTSLIHSLIEESQNQQEKNE-Z 

X-YTSLIHSLIEESQNQQEKNEQ-Z 

X-YTSLIHSLIEESQNQQEKNEQE-Z 

X-YTSLIHSLIEESQNQQEKNEQEL-Z 

X-YTSLIHSLIEESQNQQEKNEQELL-Z 

X- YTSL I HSL I EESQNQQEKNEQELLE - Z 

X-YTSLIHSLIEESQNQQEKNEQELLEL-Z 

X- YTSLIHSLIEESQNQQEKNEQELLELD- Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDK-Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKW-Z 
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X-YTSLIHSLIEESQNQQEKNEQELLELDKWA-Z 

X- YTSLIHS LI EESQNQQEKNEQELLELDKWAS - Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASL-Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLW-Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWN-Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNW-Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z . 

The one letter amino acid code is used. 
"X" may represent a hydrogen attached to the terminal amino 
group, an amino protecting group including, but not limited to, 
carbobenzoxyl, dansyl, or t-buiyloxycarbonyl; an acetyl group; a 
9-fluorenylmethoxy-carbonyl (FMOC) group; a macromolecular carrier 
group including, but not limited to, lipid-fatty acid conjugates, polyethylene 
glycol, or carbohydrates. 

"Z" may represent a terminal carboxyl (COOH); an amido group; 
an ester group (COOR) including, but not limited to, a t-butyloxycarbonyl 
group; a macromolecular carrier group including, but not limited to, lipid- 
fatty acid conjugates, polyethylene glycol, or carbohydrates. 
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TABLE II 

Amino Truncations of SEQ ID NO:5 

X-NWF-Z 

X-WNWF-Z 

X-LWNWF-Z 

X-SLWNWF-Z 

X - AS LWNWF -Z 

X-WASLWNWF-Z 

X-KWASLWNWF-Z 

X-DKWASLWNWF-Z 

X - LDKWAS LWNWF - Z 

X - ELDKWAS LWNWF - Z 

X - LELDKWAS LWNWF - Z 

X-LLELDKWASLWNWF-Z 

X -ELLELDKWASLWNWF - Z 

X -QELLELDKWAS LWNWF - Z 

X - EQELLE LDKWAS LWNWF - Z 

X-NEQELLELDKWASLWNWF - Z 

X - KNEQ ELLELDKWAS LWNWF - Z 

X - EKNEQELLELDKWAS LWNWF - Z 

X - Q EKNEQELLELDKWAS LWNWF - Z 

X- QQEKNEQELLELDKWAS LWNWF - Z 

X-NQQEKNEQELLELDKWASLWNWF-Z 

X- QNQQEKNEQELLELDKWASLWNWF - Z 

X - S QNQQEKNEQELLELDKWAS LWNWF - Z 

X - ESQNQQEKNEQELLELDKWASLWNWF - Z 

X - EESQNQQEKNEQELLELDKWASLWNWF -Z 

X-IEESQNQQEKNEQELLELDKWASLWNWF-Z 

X-LIEESQNQQEKNEQELLELDKWASLWNWF-Z 

X-SLIEESQNQQEKNEQELLELDKWASLWNWF-Z 

X-HSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 

X-IHSLIEESQNQQEKNEQELLELDKWAS LWNWF -Z 

X-LIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 

X-SLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 
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X - TS L I H S LI EESQNQQEKNEQELLELDKWASLVJNWF - Z 
X -YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 

The one letter amino acid code is used. 
| "X" may represent a hydrogen attached to the terminal amino 

group, an amino protecting group including, but not limited to, 
carbobenzoxyl, dansyl, or t-butyloxycarbonyl; an acetyl group; a 
9-fIuorenylmethoxy-carbonyl (FMOC) group; a macromolecular carrier 
group including, but not limited to, lipid-fatty acid conjugates, polyethylene 
glycol, or carbohydrates. 

"Z" may represent a terminal carboxyl (COOH); an amido group; 
an ester group (COOR) including, but not limited to, a t-butyloxycarbonyl 
group; a macromolecular.carrier group including, but not limited to, lipid- 
fatty acid conjugates, polyethylene glycol, or carbohydrates. 

The peptides also include analogs of which may include, but are not limited to, 
peptides comprising the a full-length or truncated sequence, containing one or more amino 
acid substitutions, insertions and/or deletions. 

There exists a striking amino acid conservation within the C-helical regions of 
HIV-1 and HIV-2. The amino acid conservation is of aperiodic nature, suggesting some 
conservation of structure and/or function. A useful peptide derived from the HIV-2^ 
isolate has the 36 amino acid sequence (reading from amino to carboxy terminus): 

NH r LEAMSQSLEQAQIQQEKNMYE^^ (SEQ ID NO:5) 

Further, peptides useful for forming "entry-relevant" structures include peptides 
corresponding to the N-helical domain of gp41. One example of such a peptide, P-17, 
corresponds to residues 558 to 595 of the transmembrane protein gp4 1 from the HIV- 
isolate. 

In addition to the full-length N-helical peptides (for example, (SEQ ID NO:l)) 
shown above, the peptides may include truncations of these" peptides which exhibit the 
ability to form stable coiled-coil structure. Such truncated peptides may comprise 
peptides of between 3 and 55 amino acid residues, i.e., peptides ranging in size from a 
tripeptide to a 55-mer polypeptide, as shown in Tables III and IV, below for P-17. 
Peptide sequences in these tables are listed from amino (left) to carboxy (right) terminus. 
"X" may represent an amino group (-NH 2 ) and "Z" may represent a carboxyl (-COOH) 
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group. Alternatively, M X" and/or "Z" may represent a hydrophobic group, an acetyl 
group, a FMOC group, an amido group or a covalently attached macromolecular group. 



TABLE III 

Carboxy Truncations of SEQ ID NO:2 

X-NNL-Z 

X-NNLL-Z 

X-NNLLR-Z 

X-NNLLRA-Z 

X-NNLLRAI -Z 

X-NNLLRAIE-Z 

X-NNLLRAIEA-Z 

X-NNLLRAIEAQ-Z 

X-NNLLRAIEAQQ-Z 

X-NNLLRAI EAQQH - Z 

X -NNLLRAI EAQQHL - Z 

X -NNLLRAI 33AQQHLL - Z 

X-NNLLRAI EAQQHLLQ - Z 

X -NNLLRAI EAQQHLLQL - Z 

X-NNLLRAIEAQQHLLQLT-Z 

X-NNLLRAI EAQQHLLQ LTV -Z 

X-NNLLRAI EAQQHLLQLTVW-Z 

X -NNLLRAI EAQQHLLQLTVWQ - Z 

X -NNLLRAI E AQQHLLQLTVWQ I - Z 

X-NNLLRAI EAQQHLLQ LTVWQIK- Z 

X -NNLLRAI EAQQHLLQLTVWQ I KQ-Z 

X -NNLLRAI EAQQHLLQLTVWQ I KQL-Z 

X -NNLLRAI EAQQHLLQLTVWQ I KQLQ - Z 

X -NNLLRAI EAQQHLLQLTVWQ I KQLQA- Z 

X- NNLLRAI EAQQHLLQ LTVWQIKQLQAR-Z 

X-NNLLRAIEAQQHLLQLTVWQIKQLQARI - Z 
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X-NNLLRA I EAQQHLLQLTVWQ I KQLQARI L - Z 
X-NNLLRA I EAQQHLLQLTVWQ I KQLQARI LA- Z 
X -NNLLRAI EAQQHLLQLTVWQ I KQLQARI LAV- Z 
X-NNLLRAIEAQQHLLQLTVWQIKQLQARILAVE-Z 
X - NNLLRAI EAQQHLLQLTVWQ I KQLQARI L AVER - Z 
X-NNLLRAIEAQQHLLQLTVWQIKQLQARILAVERY-Z 
X-NNLLRAIEAQQHLLQLTVWQIKQLQARILAVERYL-Z 
X -NNLLRAI EAQQHLLQLTVWQ I KQLQAR I LAVERYLK-Z 
X -NNLLRAI EAQQHLLQLTVWQ I KQLQAR I LAVERYLKD- Z 
X-NNLLRAIEAQQHLLQLTVWQI KQLQARI LAVERYLKDQ-Z 

The one letter amino acid code is used. 

"X" may represent a hydrogen attached to the terminal amino 
group, an amino protecting group including, but not limited to, 
carbobenzoxyl, dansyl, or t-butyloxycarbonyl; an acetyl group; a 
9-fluorenylmethoxy-carbonyl (FMOC) group; a macromolecular carrier 
group including, but not limited to, lipid-fatty acid conjugates, 
polyethylene glycol, or carbohydrates. 

"Z" may represent a terminal carboxyl (COOH); an amido 
group; an ester group (COOR) including, but not limited to, a 
t-butyloxycarbonyl group; a macromolecular carrier group including, but 
not limited to, lipid-fatty acid conjugates, polyethylene glycol, or 
carbohydrates. 
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TABLE IV 



Amino Truncations of SEQ ID NO:2 



10 



15 



20 



25 



30 



X-KDQ-Z 
X-LKDQ-Z 
X-YLKDQ-Z 
X-RYLKDQ-Z 
X- ER YLKDQ -Z 
X -VER YLKDQ - Z 
X - AVER YLKDQ - Z 
X - LAVERYLKDQ - Z 
X - I LAVERYLKDQ - Z 
X - R I LAVERYLKDQ - Z 
X-AR I LAVERYLKDQ -Z 
X -QARI LAVERYLKDQ - Z 
X - LQAR I LAVERYLKDQ - Z 
X -QLQARI LAVER YLKDQ - Z 
X - KQLQAR I LAVERYLKDQ - Z 
X - I KQLQAR I LAVERYLKDQ - Z 
X - Q IKQLQARI LAVERYLKDQ - Z 
X - WQ I KQLQAR I LAVERYLKDQ - Z 
X - VWQ I KQLQ ARI LAVERYLKDQ - Z 
X-TVWQ I KQLQAR I LAVERYLKDQ -Z 
X - LT VWQ I KQLQAR I LAVE R YLKDQ - Z 
X - Q LTVWQ I KQLQAR I LAVERYLKDQ - Z 
X-LQLTVWQ I KQLQAR I LAVERYLKDQ -Z 
X - LLQLTVWQ IKQLQARI LAVERYLKDQ - Z ' 
X - HLLQLTVWQ I KQLQARI LAVERYLKDQ - Z 
X - QHLLQLTVWQ I KQLQARI LAVERYLKDQ - Z 
X - QQHLLQLTVWQ IKQLQARI LAVER YLKDQ - Z 
X - AQQHLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 
X - E AQQHLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 
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X - I EAQQHLLQLTVWQ I KQLQARI LAVERYLKDQ - Z 
X - AI EAQQHLLQLTVWQI KQLQARI LAVERYLKDQ - Z 
X-RAI EAQQHLLQ LTVWQ I KQ LQAR I LAVERYLKDQ - Z 
X - LRAI EAQQHLLQLTVWQ I KQLQARI LAVERYLKDQ - Z 
X - LLRA I EAQQHLLQLTVWQI KQLQAR I LAVERYLKDQ- Z 
X -NLLRAI EAQQHLLQLTVWQ IKQLQARI LAVERYLKDQ - Z 
X-NNLLRAI EAQQHLLQLTVWQI KQLQARI LAVERYLKDQ- Z 

The one letter amino acid code is used. 

"X" may represent a hydrogen attached to the terminal amino 
group, an amino protecting group including, but not limited to, 
carbobenzoxyl, dansyl, or t-butyloxycarbonyl; an acetyl group; a 
9-fluorenyImethoxy-carbonyl (FMOC) group; a macromolecular carrier 
group including, but not limited to, lipid-fatty acid conjugates, 
polyethylene glycol, or carbohydrates. 

"Z" may represent a terminal carboxyl (COOH); an amido 
group; an ester group (COOR) including, but not limited to, a 
t-butyloxycarbonyl group; a macromolecular carrier group including, but 
not limited to, lipid-fatty acid conjugates, polyethylene glycol, or 
carbohydrates. 

The N-helical peptides also include analogs and/or truncations which may 
include, but are not limited to, peptides comprising the full-length or a truncated 
sequence, containing one or more amino acid substitutions, insertions and/or deletions. 



Antibody Generation and Characterization 



Generation and characterization of the antibodies generated against novel gp41 
epitopes constitutes the second aspect of the invention. The experimental sera and 
monoclonal antibodies generated against the target immunogens are subjected to 
thorough biophysical and biological evaluation. 

Antibodies are generated following established protocols. All small animal work 
(immunizations, bleeds, and hybridoma production) is carried out by standard methods 
known to those of skill in the art. A first set of immunogens consists of the peptide 
constructs P-15 or P-17 (capable of forming trimeric coiled-coil multimers, optionally 
stabilized by chemical cross-linking or oxidation), P-16 or P-18, and the P-17/P-18 
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mixture or P-15/P-16 mixture (wherein the peptides are optionally chemically or 
oxidatively cross-linked). In one set of experiments, the immunogens are conjugated to 
a carrier such as KLH. 

Balb-c mice are immunized with each of these constructs. Due to possible 

5 disruptive effects of carrier conjugation on antigen structure, one group of mice from 
each set can be immunized with 100 jug of unconjugated peptide, while another group 
of mice can receive 100 |ig of antigen conjugated to KLH. Following the initial 
immunization the animals receive a 100 fig boost on day 14 followed by 50 fig boosts on 
days 30 and 45. Bleeds occur two weeks following the final boost. Mice are also 

10 immunized with the recombinant constructs following the same outline as that for the 
peptide immunogens. 

Alternative immunization approaches include the use of a recombinant 
adenovirus vector expressing all or part of the HIV-1 envelope glycoprotein gpl20/gp41 
as the primary immunogen followed by booster immunizations with the gp41 peptides, 

1 5 proteins or other constructs. 

The polyclonal sera generated by the immunization of experimental animals 
undergo an initial screen for virus inhibition. Antiviral activity is evaluated in both 
cell-cell fusion and neutralization assays. In this second format, a representative sample 
of lab adapted and primary virus isolates is used. Both assays are carried out according 

20 to protocols described previously (Wild, C, et al, Proc. Natl. Acad. Sci. USA 89: 10537- 
10541 (1992); Wild, C, et al, Proc. Natl Acad ScL USA 97:12676-12680 (1994); 
Wild, C, et aL, Proc. Natl. Acad. Sci. USA 97:9770-9774 (1994)). Samples are also 
screened by ELISA to characterize binding. The antigen panel includes all experimental 
. immunogens. Animals with sera samples which test positive for binding to one or more 

25 experimental immunogens are candidates for use in MAb production. Following this 
initial screen, one animal representing each experimental immunogen is selected for 
monoclonal antibody production. The criteria for this selection is based on neutralizing 
antibody titers and in the absence of neutralization, binding patterns against the panel of 
structured immunogens. 

30 Hybridoma supernatants are screened by ELISA, against structured and 

non-structured peptides and recombinants. Samples that are ELISA negative or weakly 
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positive are further characterized for IgG. If IgG is present the material is screened in 
the biophysical and biological assays. Strongly positive samples are screened for their 
ability to neutralize virus and bind envelope. The experimental material can be further 
tested against a panel representing the spectrum of HIV-1 isolates. These isolates 
5 include lab adapted and primary virus strains, syncytium and non-syncytium inducing 
isolates, virus representing various geographic subtypes, and viral isolates which make 
use of the range of second receptors during virus entry. These neutralization assays 
employ either primary cell and cell line targets as required. 

Antibodies are characterized in detail for their ability to bind HIV envelope under 

10 various conditions. It is another object of the invention to determine the gp41 target 
epitopes are exposed on native envelope or if the envelope must first undergo some 
interaction which triggers a conformational change Le binding CD4 and/or co-receptor 
in order to expose these epitopes. For detection of antibody binding to native envelope, 
immunoprecipitations on Env-expressing cells and virions, both intact and lysed are 

15 performed using non-ionic detergents (Furata, RA et aL, Nat. Struct. Biol 5(4J:276-279 
(1997); White, J. M. and I. A. Wilson, J. Cell Biol 705:2887-2894 (1987); Kemble, G. 
W., et aL, J. Virol (56:4940-4950 (1992)). Antibody binding to cell lysates and intact 
virions are also assayed in an ELISA format. Flow cytometry experiments are 
performed to determine binding to envelope expressing cells. Cross-competition 

20 experiments using other mapped Mabs, human sera, and peptides can also be performed. 
To characterize "triggers" to the conformational change, antibody binding to virus in the 
presence and absence of both sCD4 and target cells can be compared (White, J. M. and 
L A. Wilson, 1 Cell Biol 705:2887-2894 (1987); Kemble, G. W., et al t 7. Virol 
66:4940-4950 ( 1 992)). Because the gp4 1 regions are highly conserved, epitope exposure 

25 using several different envelopes can be compared to discern possible differences in 
structure between primary, lab-adapted and genetically diverse virus isolates. 

Pharmaceutical Compositions and Methods of Using 

The immunogenic constructs of the present invention can be employed in 
vaccines in an amount effective depending on the route of administration. Although 



WO 00/40616 



-41- 



PCT/US00/00456 



subcutaneous or intramuscular routes of administration are preferred, peptides, multimers 
or peptide conjugates of the present invention can also be administered by an 
intraperitoneal or intravenous route. One skilled in the art will appreciate that the 
amounts to be administered for any particular treatment protocol can be readily 

5 determined without undue experimentation. 

The vaccines of the present invention may be employed in such forms as 
capsules, liquid solutions, suspensions or elixirs for oral administration, or sterile liquid 
forms such as solutions or suspensions. Any inert carrier is preferably used, such as 
saline, phosphate-buffered saline, or any such carrier in which the conjugate vaccine has 

10 suitable solubility properties. The vaccines may be in the form of single dose 
preparations or in multi-dose flasks which can be used for mass vaccination programs. 
Reference is made to Remington's Pharmaceutical Sciences, Osol, ed., Mack Publishing 
Co., Easton, PA (1980), and New Trends and Developments in Vaccines, Voller, et al y 
eds., University Park Press, Baltimore, MD (1978), for methods of preparing and using 

15 vaccines. 

The vaccines of the present invention may further comprise adjuvants which 
enhance production of HIV-specific antibodies. Such adjuvants include, but are not 
limited to, various oil formulations such as Freund's complete adjuvant (CFA), stearyl 
tyrosine (ST, see U.S. Patent No. 4,258,029), thedipeptide known as MDP, saponins and 

20 saponin derivatives, such as Quil A and QS-21, aluminum hydroxide, and lymphatic 
cytokine. Preferably, an adjuvant will aid in maintaining the secondary and quaternary 
structure of the immunogens. 

Freund's adjuvant is an emulsion of mineral oil and water which is mixed with 
the immunogenic substance. Although Freund's adjuvant is powerful, it is usually not 

25 administered to humans. Instead, the adjuvant alum (aluminum hydroxide) or ST may 
be used for administration to a human. The vaccine may be absorbed onto the aluminum 
hydroxide from which it is slowly released after injection. The vaccine may also be 
encapsulated within liposomes according to Fullerton, U.S. Patent No. 4,235,877, or 
mixed with or liposomes or lipid mixtures to provide an environment similar to the cell 

30 surface environment. 
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In another preferred embodiment, one or more immunogens of the invention are 
combined with other immunogens that are used to vaccinate animals. 

In another preferred embodiment, the present invention relates to a method of 
inducing an immune response in an animal comprising administering to the animal the 
vaccine of the invention in an amount effective to induce an immune response. 
Optionally, the vaccine of the invention may be coadministered with effective amounts 
of other immunogens as mentioned above to generate multiple immune responses in the 
animal. 

Compositions of the invention are useful as vaccines to induce active immunity 
towards antigens in subjects. Any animal that may experience the beneficial effects of 
the compositions of the present invention within the scope of subjects that may be 
treated. The subjects are preferably mammals, and more preferably humans. 

The administration of the vaccine may be for either a "prophylactic" or "therape- 
utic" purpose. When provided prophylactically, the vaccine(s) are provided in advance 
of any symptoms of HIV infection, or in advance of any known exposure to HIV. The 
prophylactic administration of the vaccine(s) serves to prevent or attenuate any 
subsequent infection. When provided therapeutically, the vaccine(s) is provided upon or 
after the detection of symptoms which indicate that an animal may be infected with HIV, 
or upon or after exposure to the virus. The therapeutic administration of the vaccine(s) 
serves to attenuate any actual infection, for example as measured by improving the 
symptoms of a subject, or by reducing the level of viral replication. Thus, the vaccines, 
may be provided either prior to the onset of infection (so as to prevent or attenuate an 
anticipated infection) or after the initiation of an actual infection. 
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Example 1 

Immunogens of the Present Invention Elicit a Neutralizing Antibody Response to 
Entry-Relevant Structures on HIV-1 gp41 

Materials and Methods 

Polyclonal sera can be obtained by immunizing rabbits or guinea pigs using 
methods well known to those skilled in the art. For example, the animals are immunized 
at multiple sites (sub-cutaneous and sub-clavicular) with a total of 200 pg (rabbits) or 
100 pg (guinea pigs) of the appropriate peptide, protein, combination of construct in 
complete Freund's adjuvant. This is followed by two booster immunizations with the 
same immunogen in incomplete Freund's adjuvant at monthly intervals following the 
primary immunization. Sera are collected prior to, and at intervals following, the series 
of immunizations. These sera are analyzed for the presence of antibodies to the 
immunogen or other antigens by various assays including those described below. 

Peptide ELISA: Antigen was coated onto 96-well microtiter plates (Immulon 2) at 1 
pg/well. Following overnight incubation at 4 °C, plates were washed, blocked and test 
sera was added. After a 1.5 hr incubation, plates are washed and bound antibody is 
detected by addition of phosphatase-conjugated secondary antibody and development by 
pNPP. 

Dot Blots: Antigen (2 |ig) was blotted onto nitrocellulose, blocked and allowed to air 
dry. Blots were incubated 4 hr with test sera at a 1: 100 dilution. A secondary antibody 
peroxidase/TMB detection system was used. 

Western Blots: Immunoblots were carried out using commercially available strips 
(Organon Teknika) with test sera at 1:100. 

Viral Lysate Immunoprecipitation: HIV- 1 infected cells (IHB/H9) were lysed and 
mixed with immune sera at a dilution of 1:100. Following incubation with Protein A- 
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agarose, immunoprecipitates were separated by SDS-PAGE and probed with a gp41 
specific monoclonal antibody." 

Cell Surface Immunoprecipitation: Two days post transfection, 1.5 x 10 7 envelope 
expressing 293T cells were incubated with experimental sera with and without sCD4 
5 (10fig/ml unless otherwise noted). Following incubation, cells were lysed and incubated 
with Protein A-agarose. Immunoprecipitates were separated by SDS-PAGE and probed 
with a gp41 specific monoclonal antibody. 

Neutralization Assay: Test sera was incubated at a 1 : 10 dilution with indicated amount 
of virus (HIV-1 IIIB) for lhr at 37 °C. At the end of this time target cells were added 
10 (CEM) and the experiment was returned to the incubator. On days 1, 3 and 5, post- 
infection complete media changes were carried out. On day 7 PI culture supernatant 
were harvested. Levels of virus replication were determined by p24 antigen capture. 
Levels of replication in test wells were normalized to virus only controls. 

Results 

1 5 Rabbits or guinea pigs were immunized and sera analyzed by methods described 

above. The following data describe the characterization of polyclonal antibodies 
generated to various immunogens that are the subject of this invention. 

Table V illustrates results of the analysis of polyclonal sera to various 
immunogens analyzed by peptide ELISA or dot blots. Several immunogens elicited a 

20 strong antibody response in these assays. For example, immunization with P15 resulted 
in sera with strong antibody reactivity to PI 5 by peptide ELISA (titer >1: 102400), and 
strong reactivity to PI 5, a mixture of P15 + P16 and HIV-1 gp41 by dot blot. Similar 
results were obtained in these assays following immunization with a mixture of PI 5 and 
PI 6 (Table V). 

25 Description of Table V: Analysis of polyclonal sera to various immunogens by 

peptide ELISA or dot blot. For this and subsequent figures all results are based on 
immunizations of rabbits except for immunizations with P-17 or P-18 alone which were 
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performed in guinea pigs. The immunogens used are indicated in the vertical list on the 
left side of the table. The antigens used in each assay are indicated on the top row of the 
table. Peptide ELISA results are presented as titers (the maximum dilution that gives a 
positive result in the assay). Dot blot results are scored from - (no reactivity) to +++ 
(very strong reactivity). HIVTMisHIV-1 gp41. For Table V, *BS 3 refers to chemically 
cross linked; and ND indicates "not determined." 
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These results were confirmed and extended by analysis of the polyclonal sera for 
reactivity with HIV-1 gp!20, gp41 or gpl60 by western blot or immunoprecipitation 
(Table VI). For example, immunization with P15orP15 + P16 elicited antibodies that 
reacted with gpl60 by western blot, and which precipitated gp41 in infected cell lysates. 
Of particular interest, P15 + P16 elicited an immune response that reacted with cell 
surface gp41, but only following treatment of the cells with sCD4 (Figure 4). Previous 
reports have found that sCD4 binds to gpl20 resulting in conformational changes in 
gpl20/gp41 or stripping of gpl20 from gp41. This process presumably mimics events 
that occur at attachment of HIV- 1 to its receptor CD4 on target cells. The present results 
suggest that immunization with the mixture of P15 + P16 elicits an immune response to 
cryptic epitopes on gp41 that are only exposed following binding of gpl20 to CD4. 
Table VI: Analysis of polyclonal sera to various immunogens by western blot or 
immunoprecipitation. The immunogens used are indicated in the vertical list on the left 
side of the table. The antigens used in each assay are indicated on the top row of the 
table. Results are scored from - (no reactivity) to ++++ (very strong reactivity), w: weak 
reactivity; * : BS 3 chemically cross-linked prior to administration; ND: not determined; 
HIVTM:HIV-1 gp4L 
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Figure 5 provides data demonstrating that immunogens of the present invention 
elicit a neutralizing antibody response. While some non-specific inhibition of HIV-l 
replication is seen following incubation with pre-bleed sera, considerably greater 
inhibition is seen following incubation with sera from animals immunized with Pl5 or 
P 1 5 + P 1 6. These results indicate that these sera contain neutralizing antibodies resulting 
from immunization with the immunogen of, and by the methods of, the current 
invention. 

These data are supported by the fact that monoclonal antibodies have been 
generated in mice to several of the immunogens discussed above. When analyzed by 
some of the methods described above similar results were obtained to those seen with the 
polyclonal sera (not shown). 
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Discussion 

The structural components of gp41, which are present only during virus entry, 
form a novel set of neutralizing epitopes. The relatively short lived nature of these entry 
relevant structures and their presence only during natural infection would account for the 
observation that neutralizing antibodies targeting gp41 epitopes are poorly represented 
in sera from HIV infected individuals and all but absent in vaccinee sera. This theory 
is supported by work involving synthetic peptides which model the regions of gp41 
identified as taking part in the entry related structural reorganization (Wild, C, et aL, 
Proc. Natl. Acad, Sci. USA 59:10537-10541 (1992); Wild, C, et al t AIDS Res. Hum. 
Retroviruses 9: 105 1-1053 (1993); Wild, C, etal, Proc. Natl Acad. Sci. USA 91: 12676- 
12680 (1994); Wild, C, et aL AIDS Res. Hum. Retroviruses 77:323-325 (1995); Wild, 
C, et aL, Proc. Natl Acad. ScL USA 91:9110-911 A (1994)). It has been shown that these 
materials inhibit HIV infection by blocking virus entry and the mechanism of their 
activity is their ability to interact with and disrupt gp41 structural components critical 
to the entry event. Although transitory, these gp41 entry structures are both accessible 
and appropriately sensitive targets for neutralizing antibody. 

Independent of their neutralizing potential, monoclonal antibodies targeted to 
conserved structures in the TM will prove invaluable as reagents for dissecting the 
structural transitions that occur in Env as part of virus entry. 

We have been successful in our initial attempt to generate a structure specific 
antibody against the coiled-coil region of gp41. In this work we used a modified form 
of the P-17 peptide as immunogen and generated MAbs that recognize the structured 
peptide but not a proline containing P-17 analog which is unstructured. Also, this 
antibody can co-immunoprecipitate the P-18 peptide. 



\ 
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Example 2 

Neutralizing Antibody Response to Peptides Modeling 
the C-helical Region of gp41 

This example measures the humoral response to antigens modeling the C-region 
5 of gp41. This work used synthetic peptides and a recombinant form of viral protein to 
characterize antibodies raised against the C-helical regions of gp41 of the viral TM. 

These studies employ antibody binding assays to determine the ability of these 
materials to generate an immune response to various forms of envelope (native vs. 
denatured) and virus neutralization assays to characterize the antibody response raised 
10 against these gp41 domains. The complete panel of immunogens has generated data 
which allow new insight into the antigenic nature of gp4 1 . Most encouraging have been 
the results from Guinea Pigs immunized with the peptide, P-18, modeling the C-helix 
entry domain (amino acid residues 643-678 of gp41). Specifically, two of three animals 
receiving this material exhibited a neutralizing antibody response against divergent virus 
15 isolates in a variety of assay formats. Additional studies have confirmed these results. 
See Example 3. 

In study 1, guinea pigs were immunized intramuscularly with 100 pg of P-18 
formulated in either Freund's complete (prime) or incomplete (boost) adjuvant. Animals 
were immunized on days 0, 21, 34, 48 and 62. Blood was collected on days 44, 58 and 

20 72. In the initial screen, sera at at 1:10 dilution were tested for ability to inhibit virus- 
induced cell killing. In these assays two of the three animals receiving the P-18 peptide 
(guinea pigs gp233 and gp234) were able to block the cytopathic effects of a pair of 
prototype HIV-1 isolates. Against the MN isolate >80% protection was achieved while 
against RF protection was >50%. 

25 In an assay employing the same format (against HIV-1 MN ), we titrated the sera 

from gp233 and gp234. As can be seen in Figure 6a, these animals displayed the 
expected dose-related anti-viral activity. Guinea pigs 233 and 234 gave a 50% ^reduction 
in virus-induced cell killing at 1:40 and 1:37 dilutions, respectively. 

A neutralization assay was carried out employing a different target cell and 

30 endpoint analysis. In this format, CEM T-cell line was inoculated with 200 TCID 50 of 
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the HIV-1 MN isolate. The reduction in viral replication for gp233 and gp 234 at a serum 
dilution of 1:10 is shown in Figure 6b. 

Figure 6a shows the titration of bleed 2 for each animal against HIV-1 MN in the 
cell killing assay which uses cell viability as a measure of virus neutralization. MT2 
cells are added to a mixture of virus (sufficient to result in greater than 80% cell death 
at 5 days post infection) and sera which had been allowed to incubate for approximately 
1 hr. After 5 days in culture, cell viability is measured by vital dye metabolism. Figure 
6b shows the percent neutralization for each bleed at a 1: 10 dilution against HIV-1^ in 
an assay format employing CEM targets and p24 endpoint. In this assay, sera are 
incubated with 200 TCID 50 of virus for 1 hr prior to the addition of cells. On days 1, 3, 
and 5 media are changed. On day 7 culture supernatants are collected and analyzed for 
virus replication by p24 antigen levels. In each assay format, percent neutralization is 
determined by comparison of experimental wells with cell and cell/virus controls. 

The pattern of virus neutralization observed in the previous assays is repeated. 
At this serum dilution, bleed # 2 for guinea pigs 233 and 234 gave 80% and 90% virus 
neutralization, respectively. The same pattern of results was observed against the HIV- 
1 SF2 isolate where under identical assay conditions bleed # 2 from animals 233 and 234 
gave 70% and 50% neutralization. Control animals receiving adjuvant only exhibited 
no neutralizing activity. 

These sera neutralize the HIV- 1 isolates MN, RF, and SF2. These results indicate 
a breadth of activity unseen in most other subunit immunogens. By comparison, sera 
generated against V3 peptides are restricted in their activity to a small set of very closely 
related isolates. Due to the nature of the experiment the low antibody titers are not 
unexpected. These animals were immunized with free peptide formulated in Freund's 
adjuvant. Neither carrier molecules nor accessory proteins were used to enhance the 
immune response to this molecule. Results from binding assays indicate low but 
appreciable levels of antibody against viral envelope. 

In ELIS A assays using recombinant gp4 1 endpoint titers of 1 :6400- 1 :44,800 were 
observed for these samples. Linking P-18 to KLH (or other carrier molecules) and/or 
administering the conjugate in an adjuvant designed to enhance the immunogenicity of 
subunit antigens is expected to result in a significant increase in neutralizing response. 
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Example 3 

In a second study, 2 out of 3 animals immunized with P- 18 neutralized the HIV-1 
MN isolate in the assay using the MT2 cell line. 

animal neutSO titer 

5 BT004 1:21 

BT005 1:14 

Also, one animal receiving P-18 coupled to KLH neutralized the MN isolate in 
the same assay format. 

animal neutSO titer 

10 BT007 1:15 

Example 4 

The peptide used to generate the immune response in Example 2 includes within 
its sequence the linear epitope for the 2F5 monoclonal antibody. To determine if our 
immune response was against this same region of envelope, or involved a previously 

15 unidentified neutralizing epitope, a series of binding experiments was carried out to 
characterize the reactivity of our polyclonal sera. As can be seen in Table 1 , at a dilution 
of 1 : 100 all animals exhibit good ELISA binding to the cognate immunogen (P-18). Sera 
from these animals also have substantial antibody titers against a peptide derived from the 
N-terminal P-18 sequence, PI (Table VII). However, when tested at this same dilution 

20 against a pair of C-terminal P- 1 8 analogs, P2 and P3 (Table VII) no ELISA reactivity was 
observed (Table VET). This result is significant in that the P3 peptide includes the linear 
binding region (ELDKWAS) for the 2F5 monoclonal antibody. These results demonstrate 
that the neutralizing activity in our sera is not due to binding to the 2F5 epitope. 
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Table VII 

Set of three overlapping peptides corresponding to the P-18 peptide 

PI YTSLIHSLIEESQNQQEK (SEQIDNO:77) 

P2 EESQNQQEKNEQELLELD (SEQIDNO:78) 

5 P3 LELDKWASLWNWF 

(SEQ ID NO:79) 

P-18 YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF (SEQIDN0:5) 

Table VIII 

ELISA binding at 1:100 (OD) 

10 Sample Pi P2 P3 PAS 

gp232-2 0.833 0.124 0.003 1.423 

gp232-3 0.858 0.022 0.009 1.067 

gp233-2 1.024 0.019 0.010 1.314 

gp233-3 0.885 0.015 0.015 1.161 

15 gp234-2 0.492 0.015 0.016 1.152 

gp234-3 0.796 0.012 0.009 0.913 

ELISA binding by guinea pig sera to P-18 and a set of overlapping peptides 
corresponding to P-18. The majority of the antibody binding is to P-18 and the 
N-terminal peptide PI. Very little or no reactivity is observed against P2 and P3 
20 modeling the C-terminal region of P-18. 
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Example 5 

Expression of Recombinant gp41 Construct 

The plasmid for expression of the construct containing the N- and C-helical 
domains of HIV-1 gp41 separated by a short linker sequence (See FIG. 7) was prepared 

5 as follows. The bacterial expression vector pTCLE-ssG2C, (based on pAED-4, a T7 
expression vector developed specifically for the expression of small proteins) provided 
by Dr. Terrance Oas, Duke University was digested at the unique restriction sites Ndel 
and Ecorl and gel purified using the Qiaex system. The DNA fragments encoding the N- 
and C-helical regions of HIV- 1 gp41 and a short linker sequence were PGR amplified 

10 by standard techniques from gp41 expression vectors using the following primers. 
N-helix primer pair: 

5'; 5' GGG CCC ATA TGG GTA TTG TTC AGC AG 3' (includes Ndel site), 

(SEQ ID NO:80) 

y ; y GGG CCG GCG CCT GAG CCG CCG CCT TG A TCC TTC AGG TAG CGT TC 
15 3' (includes Narl site). (SEQ ID NO:81) 

C-helix primer pair: 

5'; 5' GGG CCG GCG CCG GCT CAG AGT GGG ACA GAG AAA TTA ACA ATT 
AC 3' (includes Narl site), (SEQ ID NO:82) 

3'; 5* GGG CCG AAT TCT TAA AAC CAA TTC CAC AAA CTT GCC CAT TT 3' 

20 (includes Ecorl site and a stop codon). (SEQ ID NO:83) 

These fragments were inserted (blunt end ligation) into the TA vector which was 
amplified to generate larger amounts of DNA. The fragments coding for to the N and C- 
heliees were released from the TA vector by restriction digest (C-helix: Narl and EcoRI, 
N-helix: Ndel and Narl) and gel purified. A three-way ligation was performed using 

25 standard procedures to introduce the DNA coding for the N- and C-helical fragments into 
the pTCLE-ssG2C vector. The product of this step was characterized by restriction 
digestion and DNA sequencing. The vector containing the desired gp4 1 coding region was 
prepared in large quantity and BL-21 E. coli host cells were transformed and induced to 
express the desired protein. The desired proteins may or may not have a methionine as 
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the first amino acid at he N-terminus. Over-expression of a protein of the appropriate 
molecular weight was observed by SDS-Page gel electrophoresis. 

Although the foregoing refers to particular preferred embodiments, it will be 
understood that the present invention is not so limited. It will occur to those of ordinary 
5 skill in the art that various modifications may be made to the disclosed embodiments and 
that such modifications are intended to be within the scope of the present invention, which 
is defined by the following claims. 

All publications, patents and patent applications mentioned in this specification are 
indicative of the level of skill of those in the art to which the invention pertains. All 
10 publications, patents and patent applications are herein incorporated by reference to the 
same extent as if each individual publication or patent application was specifically and 
individually indicated to be incorporated by reference in their entirety. 
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What Is Claimed Is: 

1. A method of raising a broadly neutralizing antibody response to HIV, 
comprising: 

administering to a mammal a peptide or polypeptide comprising an amino acid that 
is capable of forming a stable coiled-coil solution structure corresponding to or mimicking 
the heptad repeat region of gp41, or a fragment thereof. 

2. The method of claim 1 , wherein a peptide is administered, and wherein said 
peptide comprises about 28 to 55 amino acids of the following sequence: 
ARQU^GIVQQQNNLXRAEAQ 

(SEQ. ID NO:l), or multimers thereof. 

3. The method of claim 2, wherein the peptide is conjugated to a carrier 

protein. 

4. The method of claim 3, wherein said carrier protein is keyhole limpet 
hemocyanin (KLH), ovalbumin, bovine serum albumin (BS A) or tetanus toxoid. 

5. The method of claim I, wherein said peptide is one of SEQ ID NO: 1, 
SEQ ID NO: 2, SEQ ID NO: 3, or one of SEQ ID NO: 9 through SEQ ID NO: 40; 
and wherein the peptide can be optionally coupled to a larger carrier protein, or optionally 
include a terminal protecting group at the N- and/or C- termini. 

6. The method of claim 1 , wherein said peptide has the formula, from amino 
terminus to carboxy terminus, of: 

^ 2 -NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-COOH 

(SEQ ID NO: 1); 

or: 

NH 2 -SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARIL-COOH 

(SEQ ID NO:2); 
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and wherein the peptide can be optionally coupled to a larger carrier protein, or optionally 
include a terminal protecting group at the N- and/or C- termini. 

7. The method of claim 1, wherein said peptide includes one to 10 
conservative substitutions. 

8. A method of raising a broadly neutralizing antibody response to HIV, 
comprising: 

administering to a mammal a peptide or polypeptide comprising an amino acid 
sequence that corresponds to, or mimics, the transmembrane-proximal amphipathic a- 
helical segment of gp4 1 (at the C-helical domain of gp4 1 ), or a fragment thereof, wherein 
said mammal raises antibodies to a helical solution structure of said peptide or 
polypeptide. 

9. The method of claim 8, wherein a peptide is administered, and wherein said 
peptide comprises about 24-56 amino acids of the following sequence: 
WNNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 
NITNW (SEQ ID NO:4), or a multimer thereof. 

10. The method of claim 8, wherein the peptide or polypeptide is conjugated 
to a carrier protein. 

11. The method of claim 10, wherein said carrier protein is keyhole limpet 
hemocyanin (KLH), ovalbumin, bovine serum albumin (BSA) or tetanus toxoid. 

12. The method of claim 8, wherein said peptide includes one to 10 
conservative substitutions. 

1 3. The method of claim 8, wherein said peptide one of SEQ ID NO: 4, SEQ 
ID NO: 5, SEQ ID NO: 6, or one of SEQ ID NO: 41 through SEQ ID NO: 74; 
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and wherein the peptide can be optionally coupled to a larger carrier protein, or optionally 
include a terminal protecting group at the N- and/or C- termini. 

1 4. The method of claim 8, wherein said peptide has the formula, from amino 
terminus to carboxy terminus, of: 

NH 2 -YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-COOH 

(SEQ ID NO:3); 

or: 

NH 2 -WMEWDREINNYTSLIHSLIEESQNQQEKNEQELL-COOH 

(SEQ ID NO:4) 

and wherein the peptide can be optionally coupled to a larger carrier protein, or optionally 
include a terminal protecting group at the N- and/or C- termini. 

15. A method of raising a broadly neutralizing antibody response to HIV, 
comprising: 

administering to a mammal a composition including one or more peptides or 
polypeptides which comprise amino acid sequences that are capable of forming solution 
stable structures that correspond to, or mimic, the gp41 core six helix bundle. 

16. The method of claim 15, wherein said one or more peptides or 
polypeptides comprise a mixture of C-helical peptide or polypeptide and N-helical peptide 
or polypeptide. 

17. The method of claim 15, wherein at least one of said peptides or 
polypeptides is multimeric, or is a conjugate structure comprised of an N-helical domain 
amino acid sequence and a C-helical domain amino acid sequence. 

18. The method of claim 15, wherein said mixture of C-helical peptide or 
polypeptide and N-helical peptide or polypeptide forms a stable core helix solution 
structure. 
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19. The method of claim 15, wherein said mixture comprises: 
P-17and P-18, 

P-15andP-16, 
P-17and P-16or 
P-15and P-18. 

20. A method of raising a broadly neutralizing antibody response to HIV, 
comprising: 

administering to a mammal a composition including one or more conjugate 
peptides or polypeptides formed from two or more amino acid sequences that comprise: 

(a) one or more amino acid sequences that are capable of forming a stable coiled- 
coil solution structure corresponding to or mimicking the heptad repeat region of 
gp41 (N-helical domain); and 

(b) one or more amino acid sequences that correspond to, or mimic, an amino acid 
sequence of the transmembrane-proximal amphipathic a-helical segment of gp41 
(C-helical domain); 

wherein 

said one or more sequences (a) and (b) are alternately linked to one another via 
a bond, such as a peptide bond (amide linkage) or by an amino acid linking sequence 
consisting of about 2 to about 25 amino acids. 

21. The method of claim 20, wherein said conjugates are recombinantly 
produced. 

22. The method of claim 2 1 , wherein one or more of said conjugates folds and 
assembles in solution into a structure corresponding to, or mimicking, the gp41 core six 
helix bundle. 

23. The method of claim 20, wherein: 

said N-helical peptide comprises about 28 to 55 amino acids of the following 
sequence: 
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ARQU^GrVQQQNNIXRAIEAQQHIJLQLTW 
(SEQ. ID NO: 1), or multimers thereof; and 

said C-helical peptide comprises about 24-56 amino acids of the following 
sequence: 

WNNMTWMEWDREINNYTSLIHSLEESQNQQEKNEQELLELDKWASLWNWF 
NITNW (SEQ ID NO:4), or multimers thereof. 

24. The method of claim 14 or claim 20, wherein: 

said N-helical peptide is one of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, 
or one of SEQ ID NO: 9 through SEQ ID NO: 40, and wherein the peptide can be 
optionally coupled to a larger carrier protein, or optionally include a terminal protecting 
group at the N- and/or C- termini; and 

said C-helical peptide is one of SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, 
or one of SEQ ID NO: 41 through SEQ ID NO: 74, and wherein the peptide can be 
optionally coupled to a larger carrier protein, or optionally include a terminal protecting 
group at the N- and/or C- termini. 

25. A conjugate peptide or polypeptide formed from two or more amino acid 
sequences that comprise: 

(a) one or more amino acid sequences that are capable of forming a stable coiled- 
coil solution structure corresponding to or mimicking the heptad repeat region of 
gp41 (N-helical domain); and 

(b) one or more amino acid sequences that correspond to, or mimic, an amino acid 
sequence of the transmembrane-proximal amphipathic a-helical segment of gp4 1 
(C-helical domain); 

wherein 

said one or more sequences (a) and (b) are alternately linked to one another via 
a bond, such as a peptide bond (amide linkage) or by an amino acid linking sequence 
consisting of about 2 to about 25 amino acids. 
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26. The conjugate of claim 25, wherein: 

said N-hclical peptide comprises about 28 to 55 amino acids of the following 
sequence: 

ARQLLSGIVQQQNNtXRAffi 
5 (SEQ. ID NO: 1), or multimers thereof; and 

said C-helical peptide comprises about 24-56 amino acids of the following 
sequence: 

WNNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 
NITNW (SEQ ID NO:4), or multimers thereof. 

10 27. The conjugate of claim 25, wherein: 

said N-helical peptide is one of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, 
or one of SEQ ED NO: 9 through SEQ ID NO: 40, and wherein the peptide can be 
optionally coupled to a larger carrier protein, or optionally include a terminal protecting 
group at the N- and/or C- termini; and 

15 said C-helical peptide is one of SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, 

or one of SEQ ID NO: 41 through SEQ ID NO: 74, and wherein the peptide can be 
optionally coupled to a larger carrier protein, or optionally include a terminal protecting 
group at the N- and/or C- termini. 

28. A pharmaceutical composition comprising a conjugate of claim 25, and a 
20 pharmaceutical acceptable carrier. 

29. A composition comprising polyclonal or monoclonal antibodies that are 
raised to the conjugate of claim 25. 

30. A composition comprising a mixture of C-helical peptide or polypeptide 
and N-helical peptide or polypeptide, wherein said mixture forms a stable core helix 

25 solution structure. 
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3 1 . The composition of claim 30, wherein: 

said N-helical peptide comprises about 28 to 55 amino acids of the following 
sequence: 

ARQLI^GIVQQQNNLLR 

(SEQ. ID NO: 1), or multimers thereof; and 

said C-helical peptide comprises about 24-56 amino acids of the following 
sequence: 

WNNMTWMEWDREINNYTSLmSLEESQNQQEKNEQELLELDKWASLWNWF 
NITNW (SEQ ID NO:4), or multimers thereof. 

32. The composition of claim 30, wherein: 

said N-helical peptide is one of SEQ ID NO: I, SEQ ID NO: 2, SEQ ID NO: 3, 
or one of SEQ ID NO: 9 through SEQ ID NO: 40, and wherein the peptide can be 
optionally coupled to a larger carrier protein, or optionally include a terminal protecting 
group at the N- and/or C- termini; and 

said C-helical peptide is one of SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, 
or one of SEQ ED NO: 41 through SEQ ID NO: 74, and wherein the peptide can be 
optionally coupled to a larger carrier protein, or optionally include a terminal protecting 
group at the N- and/or C- termini. 

33. A composition comprising polyclonal or monoclonal antibodies that are 
raised to the composition of claim 30. 

34. A method of treatment, comprising: 

administering to an individual acomposition comprising polyclonal or monoclonal 
antibodies as claimed in claim 29 or claim 33 in an amount effective to reduce HIV 
infection of uninfected cells. 

35. An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95% identical to a sequence encoding a peptide or 
polypeptide conjugate of claim 25. 
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36. The nucleic acid molecule of claim 35, wherein said polynucleotide has the 
nucleotide sequence in FIG. 7. 

37. A method for making a recombinant vector comprising inserting an 
isolated nucleic acid molecule of claim 35 into a vector. 

38. A recombinant vector produced by the method of claim 37. 

39. A method of making a recombinant host cell comprising introducing the 
recombinant vector of claim 38 into a host cell. 

40. A recombinant host cell produced by the method of claim 39. 

4 L A recombinant method for producing a conj ugate peptide or polypeptide, 
comprising culturing the recombinant host cell of claim 40 under conditions such that said 
polypeptide is expressed and recovering said- polypeptide. 

42. The method of claim 1, claim 8, claim 15 or claim 20, wherein said 
administering is provided in advance of any symptoms of HIV infection, or in advance of 
any known exposure to HIV. 

43. The method of claim 1, claim 8, claim 15 or claim 20, wherein said 
administering is provided upon or after the detection of symptoms which indicate that an 
animal may be infected with HIV, or upon or after exposure to the virus. 
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FIG. 2 
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SEQUENCE LISTING 



<110> Wild, Carl T. 

Weiss, Carol D. 

<120> Methods of Eliciting Broadly Neutralizing Antibodies 
Targeting HIV-1 gp41 

<130> 1589.016PC01 

<140> 
<141> 

<150> 60/115, 404 
<151> 1999-01-08 

<150> to be assigned 
<151> 2000-01-07 

<160> 84 

<170> Patentln Ver . 2.1 

<210> 1 
<211> 55 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 1 

Ala Arg Gin Leu Leu Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu 
1 5 10 15 

Arg Ala He Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly 
20 25 30 

He Lys Gin Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys 
35 40 45 

Asp Gin Gin Leu Leu Gly He 
50 55 



<210> 2 
<211> 38 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 2 

Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His Leu Leu Gin Leu 
1 5 10 15 

Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg lie Leu Ala Val Glu 
20 25 30 

Arg Tyr Leu Lys Asp Gin 
35 



<210> 3 
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<211> 36 
<212> PRT 

<213> Artificial Sequence 
<220> 

<:223> Description of Artificial Sequence : Synthetic 
<400> 3 

Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala 
1 5 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg He Leu 
35 



<210> 4 
<211> 56 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 4 

Trp Asn Asn Met Thr Trp Met Glu Trp Asp Arg Glu He Asn Asn Tyr 
15 10 15 

Thr Ser Leu He His Ser Leu lie Glu Glu Ser Gin Asn Gin Gin Glu 
20 25 30 

Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp 
35 40 45 

Asn Trp Phe Asn He Thr Asn Trp 
50 55 



<210> 5 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 5 

Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn Gin Gin 
1 5 10 .15 

Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu 
20 25 ^30 

Trp Asn Trp Phe 

35 : 



<210> 6 
<211> 34 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 6 

Trp Met Glu Trp Asp Arg Glu He Asn Asn Tyr Thr Ser Leu He His 
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10 



15 



Ser Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu 
20 25 30 

Leu Leu 



<210> 7 
<211> 5 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 

<400> 7 
ggggs 



<210> 8 
<211> 345 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 8 

Ala Val Gly He Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly 
15 10 15 

Ser Thr Met Gly Ala Arg Ser Met Thr Leu Thr Val Gin Ala Arg Gin 
20 25 30 

Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He 
35 40 45 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin 
50 55 60 

Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin Gin 
65 70 " 75 * 80 

Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr Thr Ala 
85 90 95 

Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Glu Gin He Trp 
100 105 110 

Asn Asn Met Thr Trp Met Glu Trp Asp Arg Glu lie Asn Asn Tyr Thr 
115 120 125 

Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys 
130 135 140 

Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn 
145 150 155 160 

Trp Phe Asn He Thr Asn Trp Leu Trp Tyr He Lys He Phe He Met 
165 170 175 

He Val Gly Gly Leu Val Gly Leu Arg lie Val Phe Ala Val Leu Ser 
180 185 190 
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Ile Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr 
195 200 205 

His Leu Pro Thr Pro Arg Gly Pro Asp Arg Pro Glu Gly lie Glu Glu 
210 215 220 

Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser lie Arg Leu Val Asn Gly 
225 ^ 230 235 240 

Ser Leu Ala Leu lie Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser 
245 250 255 

Tyr His Arg Leu Arg Asp Leu Leu Leu He Val Thr Arg He Val Glu 
260 265 270 

Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys tyr Trp Trp Asn Leu 
275 280 285 

Leu Gin Tyr Trp Ser Gin Glu Leu Lys Asn Ser Ala Val Ser Leu Leu 
290 295 300 

Asn Ala Thr Ala He Ala Val Ala Glu Gly Thr Asp Arg Val He Glu 
305 310 315 320 

Val Val Gin Gly Ala Cys Arg Ala He Arg His He Pro Arg Arg He 
325 330 335 



Arg Gin Gly Leu Glu Arg He Leu Leu 
340 345 



<210> 9 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 9 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
1 5 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
35 40 45 



<210> 10 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 10 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala 
1 5 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 



Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Arg Asp Gin 
35 40 45 
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<210> 11 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 11 

Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala 
15 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu 
35 



<210> 12 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 12 

Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His Leu Leu Gin Leu 
1 5 10 15 

Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg Val Leu Ala Leu Glu 
20 25 30 

Arg Tyr Leu Arg Asp Gin 
35 



<210> 13 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 13 

Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala 
1 ' 5 10 15 

Gin Gin Arg Met Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu Ala Val Glu Arg. Tyr Leu Gly Asp Gin 
35 40 45 



<210> 14 
<211> 36 
<212> PRT . 

<213> Human immunodeficiency virus 
<400> 14 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
1 5 10 15 

Gin Gin Arg Met Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 



Ala Arg Val Leu 
35 
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<210> 15 » 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 15 

Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin Arg Met Leu Gin Leu 
15 10 15 

Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu Ala Val Glu 
20 25 30 

Arg Tyr Leu Gly Asp Gin 
35 



<210> 16 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 16 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
15 10 15 

Gin Gin His Met Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Arg Asp Gin 
35 40 45 



<210> 17 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 17 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
15 10 15 

Gin Gin His Met Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu 
35 



<210> 18 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 18 

Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Met Leu Gin Leu 
15 10 15 

Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu Ala Leu Glu 
20 25 30 

Arg Tyr Leu Arg Asp Gin 
35 
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<210> 19 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 19 

Ser Gly lie Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu Ala 
15 10 15 

Gin Gin His Met Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
* 20 25 30 

Ala Arg Val Leu Ala He Glu Arg Tyr Leu Arg Asp Gin 
35 40 45 



<210> 20 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 20 

Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu Ala 
1 5 10 15 

Gin Gin His Met Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu 
35 



<210> 21 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 21 

Ser Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His Met Leu Gin Leu 
1 5 10 15 

Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu Ala He Glu 
20 25 30 

Arg Tyr Leu Arg Asp Gin 
35 



<210> 22 
<211> 45 
<212> PRT , 

<213> Human immunodeficiency virus 
<400> 22 . 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
1 '5 10 ' 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 



Ala Arg Val» Leu Ala Val Glu Ser Tyr Leu Lys Asp Gin 
35 40 45 
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<210> 23 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 23 

Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin, His Leu Leu Gin Leu 
1 5 10 15 

Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg Val Leu Ala Val Glu 
20 25 30 

Ser Tyr Leu Lys Asp Gin 
35 



<210> 24 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 24 

Ser Gly lie Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu Ala 
15 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Gin Asp Gin 
35 40 45 



<210> 25 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 25 

Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu Ala 
15 10 15 

\. 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu 

35 



<210> 26 
<211> 38 , 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 26 

Ser Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 
15 10 15 

Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu Ala Val Glu 
20 25 30 

Arg Tyr Leu Gin Asp Gin 
35 
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<210> 27 

<211> 45 

<212> PRT 

<213> Human immunodeficiency virus 



<400> 27 

Ser Gly lie Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu Ala 
15 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Arg Asp Gin 
35 40 45 



<210> 28 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 28 

Ser Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 
15 10 15 

Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu Ala Leu Glu 
20 25 30 

Arg Tyr Leu Arg Asp Gin 
35 



<210> 29 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 29 

Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Gin Ala 
15 10 15 

Gin Gin His Met Leu Gin Leu Thr Val Trp Gly Val Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
35 40 45 



<210> 30 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 30 

Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Gin Ala 
1 5 10 15 

Gin Gin His Met Leu Gin Leu Thr Val Trp Gly Val Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu 
35 
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<210> 31 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 31 

Ser Asn Leu Leu Arg Ala He Gin Ala Gin Gin His Met Leu Gin Leu 
15 10 15 

Thr Val Trp Gly Val Lys Gin Leu Gin Ala Arg Val Leu Ala Val Glu 
20 25 30 

Arg Tyr Leu Lys Asp Gin 
35 



<210> 32 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 32 

Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Lys Ala He Glu Ala 
15 10 15 

Gin Gin His Leu Leu Lys Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
35 40 45 



<210> 33 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 33 

Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Lys Ala He Glu Ala 
15 10 15 

Gin Gin His Leu Leu Lys Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu 
35 



<210> 34 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 34 

Ser Asn Leu Leu Lys Ala He Glu Ala Gin Gin His Leu Leu Lys Leu 
1 5 10 15 

Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu Ala Val Glu 
20 25 30 

Arg Tyr Leu Lys Asp Gin 
35 
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<210> 35 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 35 

Ser Gly He Val Gin Gin Gin Asn He Leu Leu Arg Ala He Glu Ala 
1.5 10 15 

Gin Gin His Leu Leu Gin Leu Ser He Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Lys Val Leu Ala lie Glu Arg Tyr Leu Arg Asp Gin 
35 40 45 



<210> 36 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 36 

Ser Gly He Val Gin Gin Gin Asn He Leu Leu Arg Ala lie Glu Ala 
15 10 15 

Gin Gin His Leu Leu Gin Leu Ser He Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Lys Val Leu 
35 



<210> 37 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 37 

Asn lie Leu Leu Arg Ala lie Glu Ala Gin Gin His Leu Leu Gin Leu 
15 10 15 

Ser lie Trp Gly He Lys Gin Leu Gin Ala Lys Val Leu Ala lie Glu 
20 25 30 

Arg Tyr Leu Arg Asp Gin 
35 



<210> 38 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 38 

Lys Gly lie Val Gin Gin Gin Asp Asn Leu Leu Arg Ala lie Gin Ala 
1 5 10 m 15 

Gin Gin Gin Leu Leu Arg Leu Ser Xaa Trp Gly lie Arg Gin Leu Arg 
20 25 30 

Ala Arg Leu Leu Ala Leu Glu Thr Leu Leu Gin Asn Gin 
35 40 45 
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<210> 39 
<211> 35 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 39 

Lys Gly lie Val .Gin Gin Gin Asp Asn Leu Leu Arg Ala He Gin Ala 
1 "5 10 15 

Gin Gin Gin Leu Leu Arg Leu Ser Xaa Trp Gly He Arg Gin Leu Arg 
20 25 30 

Ala Arg Leu 
35 



<210> 40 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 40 

Asp Asn Leu Leu Arg Ala He Gin Ala Gin Gin Gin Leu Leu Arg Leu 
15 10 15 

Ser Xaa Trp Gly He Arg Gin Leu Arg Ala Arg Leu Leu Ala Leu Glu 
20 25 30 

Thr Leu Leu Gin Asn Gin 
35 



<210> 41 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 41 

Trp Met Glu Trp Asp Arg Glu lie Asn Asn Tyr Thr Ser Leu He His 
1 4 5 10 15 

Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu 
20 25 30 

Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe 
35 40 45 



<210> 42 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 42 

Trp Met Glu Trp Glu Arg Glu He Glu Asn Tyr Thr Gly Leu He Tyr 
1 5 .10 15 

Thr Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Asp 
20 25 30 



Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe 
35 ' 40 45 
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<210> 43 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 43 

Trp Met Glu Trp Glu Arg Glu lie Glu Asn Tyr Thr Gly Leu lie Tyr 
1 5 10 15 

Thr Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Asp 
20 25 30 

Leu Leu 



<210> 44 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 44 

Tyr Thr Gly Leu He Tyr Thr Leu He Glu Glu Ser Gin Asn Gin Gin 
15 10 15 

Glu Lys Asn Glu Gin Asp Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu 
20 25 30 

Trp Asn Trp Phe 
35 



<210> 45 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 45 

Trp Met Glu Trp Glu Arg Glu He Asp Asn Tyr Thr Ser Glu He Tyr 
1 5 10, 15 

Thr Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu 
20 25 30 

Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe 
35 40 45 



<210> 46 
<211> 34 
<212> PRT . 

<213> Human immunodeficiency virus 
<400> 46 

Trp Met Glu Trp Glu Arg Glu He Asp Asn Tyr Thr Ser Glu He Tyr 
1 5 10 15 

Thr Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu 
20 25 30 



Leu Leu 
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<210> 47 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 47 

Tyr Thr Ser Glu He Tyr Thr Leu He Glu Glu Ser Gin Asn Gin Gin 
1 5 10 15 

Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu 
20 25 30 

Trp Asn Trp Phe 

35 



<210> 48 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 48 

Trp Met Glu Trp Glu Arg Glu He Asp Asn Tyr Thr. Asp Tyr He Tyr 
1.5 10 15 

Asp Leu Leu Glu Lys Ser Gin Thr Gin Gin Glu Lys Asn Glu Lys Glu 
20 25 30 

Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe 
35 4 0 4 5 



<210> 49 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 49 

Trp Met Glu Trp Glu Arg Glu He Asp Asn Tyr Thr Asp Tyr He Tyr 
1 5 10 15 

Asp Leu Leu Glu Lys Ser Gin Thr Gin Gin Glu Lys Asn Glu Lys Glu 
20 25 30 

Leu Leu 



<210> 50 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 50 

Tyr Thr Asp Tyr He Tyr Asp Leu Leu Glu Lys Ser Gin Thr Gin Gin 
1 5 10 15 

Glu Lys Asn Glu Lys Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu 
20 25 30 



Trp Asn Trp Phe 
35 
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<210> 51 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 51 

Trp lie Gin Trp Asp Arg Glu He Ser Asn Tyr Thr Gly He He Tyr 
15 10 15 

Arg Leu Leu Glu Glu Ser Gin Asn Gin Gin Glu Asn Asn Glu Lys Asp 
20 25 30 

Leu Leu Ala Leu Asp Lys Trp Gin Asn Leu Trp Ser Trp Phe 
35 40 45 



<-210> 52 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 52 

Trp He Gin Trp Asp Arg Glu He Ser Asn Tyr Thr Gly He lie Tyr 
1 5 10 15 

Arg Leu Leu Glu Glu Ser Gin Asn Gin Gin Glu Asn Asn Glu Lys Asp 
20 25 30 

Leu Leu 



<210> 53 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 53 

Tyr Thr Gly He He Tyr Arg Leu Leu Glu Glu Ser Gin Asn Gin Gin 
15 10 15 

Glu Asn Asn Glu Lys Asp Leu Leu Ala Leu Asp Lys Trp Gin Asn Leu 
20 25 30 

Trp Ser Trp Phe 
35 



<210> 54 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 54 

Trp Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr Gly Leu He Tyr 
15 10 15 

Asp Leu He Glu Glu Ser Gin He Gin Gin Glu Lys Asn Glu Lys Asp 
20 25 30 



Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe 
35 40 45 
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<210> 55 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 55 

Trp Met Glu Trp Glu Arg Glu lie Ser Asn Tyr Thr Gly Leu lie Tyr 
1 5 10 15 

Asp Leu lie Glu Glu Ser Gin lie Gin Gin Glu Lys Asn Glu Lys Asp 
20 25 30 

Leu Leu 



<210> 56 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 56 

Tyr Thr Gly Leu He Tyr Asp Leu He Glu Glu Ser Gin He Gin Gin 
15 10 15 

Glu Lys Asn Glu Lys Asp Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu 
20 25 30 

Trp Asn Trp Phe 
35 



<210> 57 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 57 

Trp Met Glu Trp Gin Lys Glu He Ser Asn Tyr Ser Asn Glu Val Tyr 
15 10 15 

Arg Leu He Glu Lys Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Gly 
20 25 30 

Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe 
35 40 45 



<210> 58 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 58 

Trp Met Glu Trp Gin Lys Glu He Ser Asn Tyr Ser Asn Glu Val Tyr 
15 10 15 

Arg Leu He Glu Lys Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Gly 
20 25 30 



Leu Leu 
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<210> 59 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 59 

Tvr Ser Asn Glu Val Tyr Arg Leu lie Glu Lys Ser Gin Asn Gin Gin 
1 5 10 15 

Glu Lys Asn Glu Gin Gly Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu 
20 25 30 

Trp Asn Trp Phe 
35 



<210> 60 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 60 

Trp lie Gin Trp Asp Arg Glu He Ser Asn Tyr Thr Gin Gin He Tyr 
1 5 10 15 

Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Asp 
20 25 30 

Leu Leu Ala Leu Asp Asn Trp Ala Ser Leu Trp Thr Trp Phe 
35 40 45 



<210> 61 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 61 

Trp He Gin Trp Asp Arg Glu He Ser Asn Tyr Thr Gin Gin He Tyr 
IS 10 15 

Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Asp 
20 25 30 

Leu Leu 



<210> 62 
<211> 36 
<212> PRT , 

<213> Human immunodeficiency virus 
<400> 62 

Tyr Thr Gin Gin He Tyr Ser Leu He Glu Glu Ser Gin Asn Gin Gin 
15 10 15 

Glu Lys Asn Glu Gin Asp Leu Leu Ala Leu Asp Asn Trp Ala Ser Leu 
20 25 30 

Trp Thr Trp Phe ' 
35 
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<210> 63 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 

*. 

<400> 63 

Trp Met Glu Trp Asp Arg Gin lie Asp Asn Tyr Thr Glu Val lie Tyr 
1 5 10 15 

Arg Leu Leu Glu Leu Ser Gin Thr Gin Gin Glu Gin Asn Glu Gin Asp 
20 25 . 30 

Leu Leu Ala Leu Asp Lys Trp Asp Ser Leu Trp Asn Trp Phe 
35 40 45 



<210> 64 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 64 

Trp Met Glu Trp Asp Arg Gin lie Asp Asn Tyr Thr Glu Val lie Tyr 
1 5 10 15 

Arg Leu Leu Glu Leu Ser Gin Thr Gin Gin Glu Gin Asn Glu Gin Asp 
20 25 30 

Leu Leu 



<210> 65 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 65 

Tyr Thr Glu Val He Tyr Arg Leu Leu Glu Leu Ser Gin Thr Gin Gin 
15 10 15 

Glu Gin Asn Glu Gin Asp Leu Leu Ala Leu Asp Lys Trp Asp Ser Leu 
20 25 30 

Trp Asn Trp Phe 
35 



<210> 66 
<211> 47 
<212> PRT • 

<213> Human immunodeficiency virus 
<400> 66 

Trp He Gin Trp Glu Arg Glu He Asn Asn Tyr Thr Gly He He Tyr 
1 5 10 15 

Ser Leu He Glu Glu Ala Gin Asn Gin Gin Glu Asn Asn Glu Lys Asp 
20 25 30 

Leu Leu Ala Leu Asp Lys Trp Thr Asn Leu Trp Asn Trp Phe Asn 
35 40 45 
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<210> 67 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 67 

Trp lie Gin Trp Glu Arg Glu He Asn Asn Tyr Thr Gly He He Tyr 
15 10 15 

Ser Leu He Glu Glu Ala Gin Asn Gin Gin Glu Asn Asn Glu Lys Asp 
20 25 30 

Leu Leu 



<210> 68 
<211> 37 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 68 

Tyr Thr Gly He He Tyr Ser Leu He Glu Glu Ala Gin Asn Gin Gin 
15 10 15 

Glu Asn Asn Glu Lys Asp Leu Leu Ala Leu Asp Lys Trp Thr Asn Leu 
20 25 30 

Trp Asn Trp Phe Asn 
35 



<210> 69 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 69 

Trp Gin Gin Trp Asp Glu Lys Val Arg Asn Tyr Ser Gly Val He Phe 
1 1 5 10 15 

Gly Leu He Glu Gin Ala Gin Glu Gin Gin Asn Thr Asn Glu Lys Ser 
20 25 30 

Leu Leu Glu Leu Asp Gin Trp Asp Ser Leu Trp Ser Trp Phe 
35 40 45 



<210> 70 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 70 

Trp Gin Gin Trp Asp Glu Lys Val Arg Asn Tyr Ser Gly Val He Phe 
15 10 15 

Gly Leu He Glu Gin Ala Gin Glu Gin Gin Asn Thr Asn Glu Lys Ser 
20 25 30 



Leu Leu 
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<210> 71 
<211> 36 
<212> PRT 

<213> Human imnunodef iciency virus 
<400> 71 

Tyr Ser Gly Val He Phe Gly Leu He Glu Gin Ala Gin Glu Gin Gin 
1 5 ^ 10 15 

Asn Thr Asn Glu Lys Ser Leu Leu Glu Leu Asp Gin Trp Asp Ser Leu 
20 25 30 

Trp Ser Trp Phe 
35 



<210> 72 

<211> 46 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 72 

Trp Gin Glu Trp Asp Arg Gin He Ser Asn He Ser Ser Thr He Tyr 
1 5 10 15 

Glu Glu He Gin Lys Ala Gin Val Gin Gin Glu Gin Asn Glu Lys Lys 
20 25 30 

Leu Leu Glu Leu Asp Glu Trp Ala Ser lie Trp Asn Trp Leu 
35 40 45 



<210> 73 
<2H> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 73 

Trp Gin Glu Tro Asp Arg Gin He Ser Asn He Ser Ser Thr lie Tyr 
15 10 15 

Glu Glu He Gin Lys Ala Gin Val Gin Gin Glu Gin Asn Glu Lys Lys 
2G 25 30 

Leu Leu 



<210> 74 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 74 

He Ser Ser Thr He Tyr Glu Glu He Gin Lys Ala Gin Val Gin Gin 
15 10 15 

Glu Gin Asn Glu Lys Lys Leu Leu Glu Leu Asp Glu Trp Ala Ser He 
20 25 30 

Trp Asn Trp Leu 

. 35 
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<210> 75 
<211> 465 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (34) . . (357) 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 75 

agcggtgcgc cgaaagtacg cgctaagctt cat atg ggt att gtt cag cag cag 54 

Met Gly He Val Gin Gin Gin 
1 5 

aac aat ttg ctg agg get att gag gcg caa cag cac ctg ctg cag ctg 102 
Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 
10 15 20 

acc gta tgg ggc ate aag cag ctg cag gca cgc ate ctg get gtt gaa 150 
Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg He Leu Ala Val Glu 
25 30 35 

cgc tac ctg aag gat caa ggc ggc ggc tea ggc gec ggc tea gag tgg 198 
Arg Tyr Leu Lys Asp Gin Gly Gly Gly Ser Gly Ala Gly Ser Glu Trp 
40 45 50 55 

gac aga gaa att aac aat tac aca age tta ata cac tec tta att gaa 246 
Asp Arg Glu He Asn Asn Tyr Thr Ser Leu He His Ser Leu lie Glu 
60 65 70 

gaa teg caa aac cag caa gaa aag aat gaa caa gaa tta ttg gaa tta 294 
Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu 
75 80 85 

gat aaa tgg gca agt ttg tgg aat tgg ttt gaa ttc ate gat gat ate 342 
Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Glu Phe He Asp Asp He 
90 95 100 

aga tec ggc tgc taa caaagcccga aaggaagctg agtttggctg ctgccacccg 397 
Arg Ser Gly Cys 

105 ., 

ctgagcaata actagcataa ccccttgggg gcctctaaac gggtcttgag gggttttttg 457 
cttgaaag 465 

<210> 76 
<211> 107 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Synthetic 
<400> 76 

Met Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 

1 5 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 

20 25 30 

Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin Gly Gly Gly 
35 40 45 
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Ser Gly Ala 
50 



Gly Ser Glu Trp 
55 

Ser Leu He Glu 
70 

Leu Leu Glu Leu 
85 



Asp Arg Glu He Asn Asn Tyr Thr Ser 



60 



Leu He His 
65 



Glu Ser Gin Asn Gin Gin Glu Lys Asn 



75 80 



Glu Gin Glu 



Asp Lys Trp Ala Ser Leu Trp Asn Trp 



90 95 



Phe Glu Phe 



He Asp Asp He 
100 



Arg Ser Gly Cys 



105 



<210> 77 
<211> 197 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 77 

gagggactat atccggttat tcacaaggac ggctgtgggc gccatgatcg cgtagtcgat 60 
agtggctcca agtaaccgga agcgacaggg actgtgccgg gcgccaaagg cggtcgacag 120 
tgctttctag aaccgggtgc gcataaaaat gcatcacgcc tatagcgcta gagccgctgc 180 
attaaatgaa tcggcca 197 



<210> 78 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 78 

Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn Gin Gin 
15 10 15 

Glu Lys 



<210> 79 
i <211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 79 

Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu 
15 10 15 

Leu Asp 



<210> 80 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 80 

Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe 
1 5 10 



<210> 81 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 81 

gggcccatat gggtattgtt cagcag 



<210> 82 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 82 

gggccggcgc ctgagccgcc gccttgatcc ttcaggtagc gttc 



<210> 83 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 83 

gggccggcgc cggctcagag tgggacagag aaattaacaa ttac 



<210> 84 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 84 

gggccgaatt cttaaaacca attccacaaa cttgcccatt t 
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